<86>Oct 8 00:37:10 userdel[1150251]: delete user 'rooter' <86>Oct 8 00:37:10 userdel[1150251]: removed group 'rooter' owned by 'rooter' <86>Oct 8 00:37:10 userdel[1150251]: removed shadow group 'rooter' owned by 'rooter' <86>Oct 8 00:37:10 groupadd[1150258]: group added to /etc/group: name=rooter, GID=1796 <86>Oct 8 00:37:10 groupadd[1150258]: group added to /etc/gshadow: name=rooter <86>Oct 8 00:37:10 groupadd[1150258]: new group: name=rooter, GID=1796 <86>Oct 8 00:37:10 useradd[1150265]: new user: name=rooter, UID=1796, GID=1796, home=/root, shell=/bin/bash, from=none <86>Oct 8 00:37:10 userdel[1150276]: delete user 'builder' <86>Oct 8 00:37:10 userdel[1150276]: removed group 'builder' owned by 'builder' <86>Oct 8 00:37:10 userdel[1150276]: removed shadow group 'builder' owned by 'builder' <86>Oct 8 00:37:10 groupadd[1150283]: group added to /etc/group: name=builder, GID=1797 <86>Oct 8 00:37:10 groupadd[1150283]: group added to /etc/gshadow: name=builder <86>Oct 8 00:37:10 groupadd[1150283]: new group: name=builder, GID=1797 <86>Oct 8 00:37:10 useradd[1150289]: new user: name=builder, UID=1797, GID=1797, home=/usr/src, shell=/bin/bash, from=none /usr/src/in/srpm/rccl-2.18.6-alt0.1.src.rpm: bad symbols in the license tag: // <13>Oct 8 00:37:14 rpmi: libidn2-2.3.7-alt1 sisyphus+339505.100.1.2 1706718968 installed <13>Oct 8 00:37:14 rpmi: libnettle8-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Oct 8 00:37:14 rpmi: libp11-kit-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Oct 8 00:37:14 rpmi: libtasn1-4.19.0-alt3 sisyphus+327816.100.1.1 1692802615 installed <13>Oct 8 00:37:14 rpmi: libhogweed6-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Oct 8 00:37:14 rpmi: libgnutls30-3.8.4-alt1 sisyphus+343729.100.2.1 1711571288 installed <13>Oct 8 00:37:14 rpmi: libngtcp2.16-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Oct 8 00:37:14 rpmi: libngtcp2_crypto_gnutls8-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Oct 8 00:37:14 rpmi: cmake-modules-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Oct 8 00:37:14 rpmi: libuv-1.48.0-alt2 sisyphus+357579.100.1.1 1726426171 installed <13>Oct 8 00:37:14 rpmi: librhash-1.3.5-alt3 sisyphus+286141.40.2.1 1632982456 installed <13>Oct 8 00:37:14 rpmi: libjsoncpp24-1.9.4-alt2 sisyphus+346331.200.2.1 1716448551 installed <13>Oct 8 00:37:14 rpmi: libexpat-2.5.0-alt1 sisyphus+346180.200.2.1 1716349835 installed <13>Oct 8 00:37:14 rpmi: publicsuffix-list-dafsa-20240911-alt1 sisyphus+357399.100.1.1 1726160479 installed <13>Oct 8 00:37:14 rpmi: libpsl-0.21.5-alt1 sisyphus+338474.100.1.1 1705684769 installed <13>Oct 8 00:37:14 rpmi: libnghttp3.9-1.5.0-alt1 sisyphus+356415.100.1.1 1725031855 installed <13>Oct 8 00:37:14 rpmi: libnghttp2-1.63.0-alt1 sisyphus+356414.100.1.1 1725031508 installed <13>Oct 8 00:37:14 rpmi: openldap-common-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Oct 8 00:37:14 rpmi: libntlm-1.5-alt1 sisyphus+278100.3300.1.1 1626058899 installed <13>Oct 8 00:37:14 rpmi: libidn-1.37-alt2 sisyphus+300849.100.1.1 1653769687 installed <13>Oct 8 00:37:14 rpmi: libverto-0.3.2-alt1_1 sisyphus+321176.2200.10.2 1684803947 installed <13>Oct 8 00:37:14 rpmi: liblmdb-0.9.32-alt1 sisyphus+342426.100.1.1 1710124288 installed <13>Oct 8 00:37:14 rpmi: libkeyutils-1.6.3-alt1 sisyphus+346336.200.2.2 1716472658 installed <13>Oct 8 00:37:14 rpmi: libcom_err-1.46.4.0.5.4cda-alt1 sisyphus+283826.100.1.1 1629975345 installed <13>Oct 8 00:37:14 rpmi: libbrotlicommon-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Oct 8 00:37:14 rpmi: libbrotlidec-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Oct 8 00:37:14 rpmi: rpm-macros-cmake-3.29.1-alt1 sisyphus+344518.300.3.1 1712379787 installed <13>Oct 8 00:37:14 rpmi: rpm-macros-alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Oct 8 00:37:14 rpmi: alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Oct 8 00:37:14 rpmi: ca-certificates-2024.07.01-alt1 sisyphus+351897.100.1.1 1719826350 installed <13>Oct 8 00:37:14 rpmi: ca-trust-0.2.0-alt1 sisyphus+344843.100.1.1 1712743326 installed <13>Oct 8 00:37:14 rpmi: p11-kit-trust-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Oct 8 00:37:14 rpmi: libcrypto3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <13>Oct 8 00:37:14 rpmi: libssl3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <86>Oct 8 00:37:14 groupadd[1151825]: group added to /etc/group: name=_keytab, GID=999 <86>Oct 8 00:37:14 groupadd[1151825]: group added to /etc/gshadow: name=_keytab <86>Oct 8 00:37:14 groupadd[1151825]: new group: name=_keytab, GID=999 <13>Oct 8 00:37:14 rpmi: libkrb5-1.21.3-alt2 sisyphus+351857.100.1.1 1719735141 installed <13>Oct 8 00:37:14 rpmi: libgsasl-2.2.0-alt1 sisyphus+333173.100.1.1 1698696954 installed <86>Oct 8 00:37:14 groupadd[1151836]: group added to /etc/group: name=sasl, GID=998 <86>Oct 8 00:37:14 groupadd[1151836]: group added to /etc/gshadow: name=sasl <86>Oct 8 00:37:14 groupadd[1151836]: new group: name=sasl, GID=998 <13>Oct 8 00:37:14 rpmi: libsasl2-3-2.1.28-alt2 sisyphus+343335.100.1.1 1711112544 installed <13>Oct 8 00:37:14 rpmi: libldap2-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Oct 8 00:37:14 rpmi: libarchive13-3.7.5-alt2 sisyphus+358189.100.1.1 1727162763 installed <13>Oct 8 00:37:14 rpmi: libssh2-1.11.0-alt2 sisyphus+339356.100.1.1 1706593137 installed <13>Oct 8 00:37:14 rpmi: libcurl-8.10.0-alt1 sisyphus+357271.100.1.1 1726044759 installed <13>Oct 8 00:37:15 rpmi: cmake-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Oct 8 00:37:25 rpmi: llvm-common-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:25 rpmi: llvm-rocm-filesystem-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:25 rpmi: libnuma-2.0.18-alt1 sisyphus+358102.100.1.1 1727069613 installed <13>Oct 8 00:37:25 rpmi: rocm-device-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:25 rpmi: llvm18.1-filesystem-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:26 rpmi: clang18.1-support-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:26 rpmi: llvm18.1-polly-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:26 rpmi: gcc-c++-common-1.4.28-alt1 sisyphus+348678.100.1.1 1716396142 installed <13>Oct 8 00:37:26 rpmi: libstdc++13-devel-13.2.1-alt4 sisyphus+354645.100.1.1 1723060849 installed <13>Oct 8 00:37:26 rpmi: librocm-smi1-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Oct 8 00:37:26 rpmi: libpciaccess-1:0.18.1-alt1 sisyphus+343583.300.1.1 1711440789 installed <13>Oct 8 00:37:26 rpmi: libdrm-1:2.4.123-alt1 sisyphus+357330.40.3.1 1726125397 installed <13>Oct 8 00:37:26 rpmi: libhsakmt1-6.1.2-alt0.1 sisyphus+352247.600.5.1 1720254766 installed <13>Oct 8 00:37:26 rpmi: libhsa-runtime1-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Oct 8 00:37:26 rpmi: libpci-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Oct 8 00:37:26 rpmi: pciids-20240913-alt1 sisyphus+357455.100.1.1 1726250568 installed <13>Oct 8 00:37:26 rpmi: pciutils-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Oct 8 00:37:26 rpmi: libmpdec3-2.5.1-alt3 sisyphus+314490.500.5.1 1675432004 installed <13>Oct 8 00:37:26 rpmi: libgdbm-1.8.3-alt10 sisyphus+346222.200.3.2 1716468404 installed <13>Oct 8 00:37:26 rpmi: libb2-0.98.1-alt1_1 sisyphus+291614.100.1.1 1638962877 installed <13>Oct 8 00:37:26 rpmi: python3-3.12.7-alt1 sisyphus+358796.100.1.1 1727844808 installed <13>Oct 8 00:37:27 rpmi: python3-base-3.12.7-alt1 sisyphus+358796.100.1.1 1727844808 installed <13>Oct 8 00:37:27 rpmi: clang-rocm-libs-support-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:30 rpmi: clang-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:30 rpmi: rocminfo-6.1.2-alt0.1 sisyphus+352247.1700.9.1 1720269882 installed <13>Oct 8 00:37:30 rpmi: libedit3-3.1.20230828-alt1 sisyphus+330914.200.3.1 1696922743 installed <13>Oct 8 00:37:30 rpmi: llvm18.1-gold-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:31 rpmi: llvm18.1-libs-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:32 rpmi: libclang-cpp18-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:32 rpmi: clang18.1-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:32 rpmi: clang-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:33 rpmi: clang-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:34 rpmi: llvm18.1-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:34 rpmi: llvm-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:47 rpmi: llvm-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:47 rpmi: libclang18-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:47 rpmi: clang18.1-devel-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:47 rpmi: clang-devel-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:48 rpmi: clang18.1-tools-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:48 rpmi: clang-tools-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:54 rpmi: clang-rocm-tools-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:54 rpmi: lld18.1-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:37:54 rpmi: lld-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:37:55 rpmi: lld-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:57 rpmi: libamd_comgr2-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:57 rpmi: llvm-rocm-gold-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:58 rpmi: llvm-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:37:59 rpmi: hip-runtime-amd-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Oct 8 00:37:59 rpmi: hipcc-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:38:01 rpmi: mlir18.1-tools-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:38:18 rpmi: llvm18.1-devel-18.1.8-alt0.2 sisyphus+357910.700.19.1 1728048814 installed <13>Oct 8 00:38:18 rpmi: llvm-devel-18.1.0-alt2 sisyphus+357910.2500.18.1 1728040850 installed <13>Oct 8 00:38:30 rpmi: llvm-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:38:30 rpmi: hip-devel-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Oct 8 00:38:30 rpmi: rocm-comgr-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:38:39 rpmi: clang-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Oct 8 00:38:40 rpmi: hipify-clang-6.1.2-alt0.1 sisyphus+352428.200.1.1 1720459887 installed <13>Oct 8 00:38:40 rpmi: hsa-rocr-devel-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Oct 8 00:38:40 rpmi: librocm-smi-devel-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Oct 8 00:38:40 rpmi: libstdc++-devel-13-alt1 sisyphus+323337.300.1.1 1687267966 installed <13>Oct 8 00:38:40 rpmi: rocm-cmake-6.1.2-alt0.1 sisyphus+352247.100.1.1 1720180839 installed Building target platforms: x86_64 Building for target x86_64 Wrote: /usr/src/in/nosrpm/rccl-2.18.6-alt0.1.nosrc.rpm (w1.gzdio) Installing rccl-2.18.6-alt0.1.src.rpm Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.57793 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf rccl-2.18.6 + echo 'Source #0 (rccl-2.18.6.tar):' Source #0 (rccl-2.18.6.tar): + /bin/tar -xf /usr/src/RPM/SOURCES/rccl-2.18.6.tar + cd rccl-2.18.6 + /bin/chmod -c -Rf u+rwX,go-w . + subst 's,cat ${ROCM_PATH}/.info/version,echo 6.1.2,' CMakeLists.txt + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.57793 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + export ALTWRAP_LLVM_VERSION=rocm + ALTWRAP_LLVM_VERSION=rocm + mkdir -p x86_64-alt-linux + cmake -DCMAKE_SKIP_INSTALL_RPATH:BOOL=yes '-DCMAKE_C_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_CXX_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_Fortran_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' -DCMAKE_INSTALL_PREFIX=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_DESTINATION=lib64 -DLIB_SUFFIX=64 -S . -B x86_64-alt-linux -Wno-dev -DROCM_PATH=/usr -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_INSTALL_LIBDIR=lib64 -DENABLE_MSCCL_KERNEL=ON -- The CXX compiler identification is Clang 17.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/clang++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Compiling for gfx803;gfx900:xnack-;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack-;gfx90a:xnack+;gfx940;gfx941;gfx942;gfx1030;gfx1100;gfx1101;gfx1102 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /usr -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc -- hipcc version: 6.1.40093 -- ROCm version: 6.1.2 ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipEventDisableSystemFence -- Looking for hipEventDisableSystemFence - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.h -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp -- HIP_UNCACHED_MEMORY enabled -- RCCL LL128 protocol enabled -- Building shared RCCL library -- rocm-cmake: Set license file to /usr/src/RPM/BUILD/rccl-2.18.6/LICENSE.txt. -- Configuring done (14.7s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_C_COMPILER CMAKE_C_FLAGS CMAKE_Fortran_FLAGS LIB_DESTINATION LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux + cmake --build x86_64-alt-linux --verbose --parallel 16 Change Dir: '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j16 gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -S/usr/src/RPM/BUILD/rccl-2.18.6 -B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux --check-build-system CMakeFiles/Makefile.cmake 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux//CMakeFiles/progress.marks gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/Makefile2 all /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /usr/src/RPM/BUILD/rccl-2.18.6/cmake/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Built target git_version_check gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/collectives/all_reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/collectives/all_to_all.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_all.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_to_allv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_allv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/channel.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/channel.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/alltoall_pivot.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/alltoall_pivot.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/broadcast.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/broadcast.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/onerank_reduce.cu -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/onerank_reduce.cu -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/broadcast.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/broadcast.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/transport/shm.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/shm.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/all_gather.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_gather.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/common_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/bootstrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/bootstrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/device/all_reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/common.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/msccl_kernel_impl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/msccl_kernel_impl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/op128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/op128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/primitives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/primitives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/reduce_scatter.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_scatter.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/msccl.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/msccl.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce_scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce_scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/sendrecv.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/sendrecv.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/collectives/sendrecv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/sendrecv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_simple.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_simple.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/debug.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/debug.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/graph/connect.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/connect.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/enqueue.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/enqueue.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rings.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rome_models.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/trees.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/trees.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/paths.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/paths.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/BfdBacktrace.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/BfdBacktrace.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/group.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/group.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/tuning.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/tuning.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/rome_models.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/search.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/search.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/align.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/align.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/archinfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/archinfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/alloc.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/alloc.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/bootstrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/bootstrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/argcheck.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/argcheck.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/channel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/channel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/checks.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/checks.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/coll_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/coll_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/core.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/core.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/cpuset.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/cpuset.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/debug.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/debug.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/comm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/comm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/enqueue.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/enqueue.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/gdrwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/gdrwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/devcomm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/devcomm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/collectives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/collectives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/git_version.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/git_version.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/graph.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/graph.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/group.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/group.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvsymbols.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvsymbols.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/info.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/info.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/ibvcore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvcore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ipcsocket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ipcsocket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_lifecycle.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_lifecycle.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_parser.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_parser.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_scheduler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_scheduler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_setup.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_setup.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/msccl/msccl_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_status.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_status.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/nccl_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nccl_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_event.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_event.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvmlwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvmlwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCuda.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCudaRt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtOpenCL.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 24%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtSync.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtPayload.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtx3.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx_stub.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx_stub.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/p2p.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/p2p.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/param.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/param.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/profiler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/profiler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/proxy.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/proxy.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_bfloat16.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_bfloat16.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_vars.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_vars.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rocm_smi_wrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocm_smi_wrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/rocmwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocmwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/shm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/shm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/signals.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/signals.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/socket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/socket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/strongstream.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/strongstream.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/timer.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/timer.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/transport.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/transport.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/trees.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/trees.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/archinfo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/archinfo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/argcheck.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/argcheck.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/include/utils.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/utils.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvsymbols.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvsymbols.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ipcsocket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ipcsocket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_status.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_status.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/npkit.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/npkit.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_lifecycle.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/init.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/init.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_parser.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_parser.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_setup.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_setup.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/nvmlwrap_stub.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/nvmlwrap_stub.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/param.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/param.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/profiler.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/profiler.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocm_smi_wrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocm_smi_wrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/signals.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/signals.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocmwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocmwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/shmutils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/shmutils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/strongstream.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/strongstream.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/utils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/utils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/nvls.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/nvls.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 41%] Hipifying src/transport/p2p.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/p2p.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/coll_net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/coll_net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/proxy.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/proxy.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_ib.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_ib.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | sIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ tatic long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ 3 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. 4 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1102. 5 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. 5 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:In file included from 21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc :22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60 : 233In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.hs:t14a: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.hi:c40 :n13c:c lwarning: Runused function 'log2i' [-Wunused-function]e sult_t x m40l | GsettaStuibcK vlIonntg( sltorgu2cit( lnocncgl Xnm)l N{o d e| * ^~~~~ node, const chIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.ccr:*24 : s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hu:b206N:a21m:e ,warning: unused function 'ncclTopoRankToIndex' [-Wunused-function]s truct 206n | csctlaXtmilcN ondcec*l*R essuubl,t _cto nnsctc lcThoapro*R aantktTroNIanmdee,x (csotnrsutc ti nntc caltTtorpVoaSlyuset)e m{* s| y ^~~~~~~~~~~~~~s te/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hm:,240 :i21n:t warning: runused function 'xmlAddNode' [-Wunused-function]a nk, int *240 | isntdaetxi)c {n c c| l ^~~~~~~~~~~~~~~~~~~R esu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hl:t217_:t21 :x mwarning: lunused function 'ncclTopoDevToRank' [-Wunused-function]A ddNo d217e | (ssttartuicct nnccccllRXemslu*l tx_mtl ,n csctlrTuocpto DnecvcTloXRmalnNko(dset*r upcatr ennctc,l TcoopnosSty sctheamr** ssyusbtNeamm,e ,i nstt rduecvt, nicnctl*X mrlaNnokd)e *{* s| u ^~~~~~~~~~~~~~~~~b ) { In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc ^~~~~~~~~~: 25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h::94256::2121:: warning: warning: unused function 'xmlGetAttrInt' [-Wunused-function]unused function 'xmlRemoveNode' [-Wunused-function] 25694 | | ssttaattiicc nnccccllRReessuulltt__tt xmxlmRleGmeotvAetNtordIen(ts(tsrturcutc tn cncclcXlmXlmNloNdeo*d en*o dneo)d e{, c| o ^~~~~~~~~~~~~n st /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc:h276a:r21*: awarning: tunused function 'kvConvertToInt' [-Wunused-function]t rName ,276 | isntta*t ivca lnucec)l R{e s u| l ^~~~~~~~~~~~~t _t k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hv:C101o:n21v:e rwarning: tunused function 'xmlGetAttrIntDefault' [-Wunused-function]T oInt(con s101t | scthaatri*c sntcrc,l Rienstu*l tv_atl uxem,l GsettrAutcttr IknvtDDiecfta*u ldti(cstt)r u{c t | n ^~~~~~~~~~~~~~c cl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hX:m289l:N21o:d ewarning: *unused function 'kvConvertToStr' [-Wunused-function] node ,289 | csotnastti cc hnacrc*l RaetsturlNta_mte ,k viCnotn*v evratlTuoeS,t ri(nitn td evfaaluulet,V aclounes)t {c h a| r ^~~~~~~~~~~~~~~~~~~~* * s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.ht:r109,: 21s: twarning: runused function 'xmlGetAttrFloat' [-Wunused-function]u ct k v109D | isctta*t idci cntc)c l{R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc e: 1364s| :u ^~~~~~~~~~~~~~l15 t:_ twarning: unused variable 'ringRemap' [-Wunused-variable]x mlGet A1364t | t r Fsltoaatti(cs tcrhuacrt rnicncglRXemmlaNpo[d2e5*6 ]n;o d e| , ^~~~~~~~~ con/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.ccs:t1385 :c7h:a rwarning: *variable 'gcnt' set but not used [-Wunused-but-set-variable] at t1385r | N a mien,t fglconatt *= v0a;l u e| ) ^ { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc :| 1460 ^~~~~~~~~~~~~~~: 9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hwarning: :unused variable 't' [-Wunused-variable]116 :21: warning: 1460unused function 'xmlFindTag' [-Wunused-function] | fl o116a | ts tta t=i c( tnvcec.ltRve_ssuelct _-t txvmsl.Ftivn_dsTeacg)(*s1tEr3u c+t (ntcvcel.Xtmvl_*u sxemcl ,- ctovnss.tt vc_huasre*c )t/a1gEN3a;m e ,| ^s truct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ ) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ TagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.ccn:s1364t: 15c:h awarning: runused variable 'ringRemap' [-Wunused-variable]* att r1364N | a m es,t actoincs tc hcahra rr*i nvgaRleumea)p [{2 5 6| ] ^~~~~~~~~~; | ^~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :157:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc21::1385 :warning: 7unused function 'xmlSetAttrIfUnset' [-Wunused-function]: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385157 | | s tiantti cg cnnctc l=R e0s;u l t| _ ^t x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.ccm:l1460S:e9t:A twarning: tunused variable 't' [-Wunused-variable]r IfUns e1460t | ( s tfrluocatt ntc c=l X(mtlvNeo.dtev*_ sneocd e-, tcvosn.sttv _csheacr)** 1aEt3t r+N a(mtev,e .ctovn_suts ecch a-r *t vvsa.ltuve_)u s{e c )| / ^~~~~~~~~~~~~~~~~1 E3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h;: 169 :| 21 ^: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx942. 28 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ clChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx1100. 20 warnings generated when compiling for gfx941. 20 warnings generated when compiling for gfx1101. 20 warnings generated when compiling for gfx803. 20 warnings generated when compiling for gfx1030. 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx940. 20 warnings generated when compiling for gfx906. 20 warnings generated when compiling for gfx908. 20 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx942. 20 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx941. 2 warnings generated when compiling for gfx1102. 2 warnings generated when compiling for gfx908. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx803. 2 warnings generated when compiling for gfx940. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. 2 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 8 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nrank/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ s, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclColIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMlhanNdelte-,> rveogiMdr*( croelclvCMohmamn,d ldea,t a ,v osiidz*e*, rteyqpuee,s tm)h a{n d l| e ^~~~~~~~~~~~~~~~~) ); return ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hu:c28c:e21s:s ;warning: unused function 'collNetIflush' [-Wunused-function]} | ^~~~~~~~~~~~ 28 | sta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht:i24c: 21n:c cwarning: lunused function 'collNetRegMrDmaBuf' [-Wunused-function]R esult_t collN e24t | Isftlautsihc( sntcrculcRte snuclctl_Cto mcmo*l lcNoemtmR,e gvMoriDdm*a Bcuofl(lsCtormumc,t vnocicdl*C odmamt*a ,c oimnmt, sviozied,* vcooildl*C ommhma,n dvloei,d *v odiadt*a*, rienqtu essitz)e ,{ iNnCtC LtCyHpEeC,K (ucionmtm6-4>_ntc colfCfosleltN,e ti-n>ti ffldu,s hv(ociodl*l*C ommhma,n ddlaet)a ,{ sNiCzCeL,C HmEhCaKn(dcloem,m -r>enqcucelsCto)l)l;N erte-t>urreng MnrcDcmlaSBuucfc(ecsosl;l C}o m m| , ^~~~~~~~~~~~~ data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h,: 29s:i21z:e ,warning: unused function 'collNetTest' [-Wunused-function]t ype, off s29e | ts,t aftdi,c mnhcacnldRlees)u)l;t _rte tcuorlnl NnectcTleSsutc(csetsrsu;c t} n c| c ^~~~~~~~~~~~~~~~~~l Comm*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :c25o:m21m:, warning: vunused function 'collNetDeregMr' [-Wunused-function]o id* requ e25s | ts,t aitnitc* ndcocnleR,e siunltt*_ ts iczoel)l N{e tNDCeCrLeCgHMErC(Ks(tcroumcmt- >nnccccllCCoomlml*N ecto-m>mt,e svto(irde*q uceosltl,C odmomn,e ,v osiidz*e )m)h;a nrdelteu)r n{ nNcCcClLSCuHcEcCeKs(sc;o m}m - >| n ^~~~~~~~~~~c clC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:l30l:N21e:t -warning: >unused function 'collNetCloseColl' [-Wunused-function]d eregMr( c30o | lsltCaotmimc, nmchcalnRdelseu)l)t;_ tr ectoulrlnN entcCclloSsueColl(strcuccets sn;c c}l C o| m ^~~~~~~~~~~~~~m * c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:m26m:,21 :v owarning: iunused function 'collNetIallreduce' [-Wunused-function]d * collComm )26 | {s tNaCtCiLcC HnEcCcKl(Rceosmuml-t>_ntc ccloClollNleNteIta-l>lcrleodsuecCeo(lslt(rcuocltl Cnocmcml)C)o;m mr*e tcuormnm ,n cvcoliSdu*c cceoslsl;C o}m m ,| ^~~~~~~~~~~~~~~~v oid*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :s31e:n21d:D awarning: tunused function 'collNetCloseListen' [-Wunused-function]a , void* r31e | csvtDaattiac, nicnctl Rceosuunltt,_ tn cccollDlaNteatTCylpoes_etL idsatteanT(yspter,u cntc cnlcRceldCOopm_mt* rceodmOmp,, vvooiidd** lsiesntdeMnhCaonmdml)e ,{ vNoCiCdL*C HrEeCcKv(Mchoamnmd-l>en,c c lvCooildl*N*e tr-e>qculeosste)L i{s t e| n ^~~~~~~~~~~~~~~~~( list/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.he:n28C:o21m:m )warning: )unused function 'collNetIflush' [-Wunused-function]; return nc c28l | Ssutcacteiscs ;n c}c l R| e ^~~~~~~~~~~~~~~~~~s ult_t coIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccl:N17e: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hI:f128l:u21s:h (warning: sunused function 'xmlFindTagKv' [-Wunused-function]t ruct ncc l128C | osmtma*t icco mnmc,c lvRoeisdu*l tc_otl lxCmolmFmi,n dvToaigdK*v (dsattrau,c ti nntc csliXzmel,* vxomild,* cmohnasntd lceh,a rv*o itda*g*N armeeq,u essttr)u c{t NnCcCcLlCXHmElCNKo(dceo*m*m -n>ondcec,l CcoolnlsNte tc-h>airf*l uastht(rcNoalmleC,o mcmo,n sdta tcah,a rs*i zaet,t rmVhaalnudel)e ,{ r e| q ^~~~~~~~~~~~u est))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h;: 144r:e21t:u rwarning: nunused function 'xmlSetAttr' [-Wunused-function] ncclSucc e144s | ss;t a}t i c| ^~~~~~~~~~~~~n cclRes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hu:l29t:_21t: xwarning: munused function 'collNetTest' [-Wunused-function]l SetAttr(st r29u | cstt antciccl XnmclcNloRdees*u lnto_dte ,c oclolnNsett Tcehsatr(*s tartutcrtN anmcec,l Ccoomnms*t ccohmamr,* vvoaildu*e )r e{q u e| s ^~~~~~~~~~t , int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h*: 157d:o21n:e ,warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function]i nt* size )157 | {s tNaCtCiLcC HnEcCcKl(Rceosmuml-t>_ntc cxlmCloSleltNAettt-r>ItfeUsnts(erte(qsutersutc,t dnocncel,X msliNzoed)e)*; nroedteu,r nc onncsctl Scuhcacre*s sa;t t}r N a| m ^~~~~~~~~~~e , con/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hs:t30 :c21h:a rwarning: *unused function 'collNetCloseColl' [-Wunused-function] value) { 30 | | s ^~~~~~~~~~~~~~~~~t atic/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :n182c:c21l:R ewarning: sunused function 'xmlSetAttrFloat' [-Wunused-function]u lt_t c o182l | lsNteattCilco snecCcollRle(ssutlrtu_ctt xnmclcSleCtoAmtmt*r Fcloomamt,( svtoriudc*t cnoclcllCXommlmN)o d{e *N CnCoLdCeH,E CcKo(ncsotm mc-h>anrc*c laCtotlrlNNaemte-,> ccloonssetC oflllo(acto lvlaCloumem)) ){; r| e ^~~~~~~~~~~~~~~t urn /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hn:c195c:l21S:u cwarning: cunused function 'xmlUnsetAttr' [-Wunused-function]e ss; } 195 | | s ^~~~~~~~~~~~~~~~t atic /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hn:c31c:l21R:e swarning: uunused function 'collNetCloseListen' [-Wunused-function]l t_t xmlU n31s | esttAatttirc( sntcrculcRte snuclctl_Xtm lcNooldleN*e tnColdoes,e Lciosntsetn (cshtarru*c ta tntcrcNlaCmoem)m *{ c o| m ^~~~~~~~~~~~m , v/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.ho:i233d:*21 :l iwarning: sunused function 'xmlGetSubKvInt' [-Wunused-function]t enComm )233 | {s tNaCtCiLcC HnEcCcKl(Rceosmuml-t>_ntc cxlmCloGleltNSeutb-K>vcIlnots(esLtirsutcetn (nlcicsltXemnlCNoomdme)*) ;n ordeet,u rcno nnsctc lcShuacrc*e sssu;b N}a m e| , ^~~~~~~~~~~~~~~~~~ struct nIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccc:l17X: m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hl:N128o:d21e:* *warning: unused function 'xmlFindTagKv' [-Wunused-function]s ub, const c128h | asrt*a taitct rnNcacmleR,e scuolnts_tt ixnmtl FaitntdrTVaaglKuve()s t{r u c| t ^~~~~~~~~~~~~~ nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hX:m256l:*21 :x mwarning: lunused function 'xmlRemoveNode' [-Wunused-function], con s256t | scthaatri*c tnacgcNlaRmees,u lstt_rtu cxtm lnRcecmloXvmelNNooddee(*s*t rnuocdte ,n cccolnXsmtl Ncohdaer** naotdter)N a{m e ,| ^~~~~~~~~~~~~c ons/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.ht: 289c:h21a:r *warning: unused function 'kvConvertToStr' [-Wunused-function]a ttrV a289l | uset)a t{i c | n ^~~~~~~~~~~~c clRes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hu:l144t:_21t: kwarning: vunused function 'xmlSetAttr' [-Wunused-function]C onvert T144o | Ssttra(tiinct nvcaclluRee,s uclotn_stt xcmhlaSre*t*A tsttrr(,s tsrturcutc tn ckcvlDXimcltN*o ddei*c tn)o d{e , | c ^~~~~~~~~~~~~~o nst char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, vo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ccid* col:l865C:o19m:m ,warning: variable 'cId' set but not used [-Wunused-but-set-variable]v oid* sendData, vo i865d | * rienctv DgaItnad,e xi n=t 0c,o ucnItd, =n c0c,l Dna t=a T0y;p e _| t ^ dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ Result_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResultIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] _t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx1102. 23 warnings generated when compiling for gfx908. 23 warnings generated when compiling for gfx900. 23 warnings generated when compiling for gfx803. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx906. 23 warnings generated when compiling for gfx1101. 23 warnings generated when compiling for gfx941. 23 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for gfx940. 23 warnings generated when compiling for gfx1030. 23 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx942. 23 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx803. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx900. 10 warnings generated when compiling for gfx908. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx941. 10 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(coll/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ Comm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc ^~~~~~~~~~~~: 1995:26/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:: 24warning: :unused variable 'payload' [-Wunused-variable]21 : warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 1995 | Nvt x24P | asrtaamtsiCco mnmcIcnliRteRsaunlkt _pta ycloolaldN{emtyRreagnMkr,D mnarBaunfk(ss,t rcuucdta Dnecvc}l;C o m| m ^~~~~~~* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc | :s2009t:a38t:i cwarning: unused variable 'CommInitAllSchema' [-Wunused-variable]n cclRe s2009u | l t _cto nksvtCeoxnpvre rntvTtoxIPnaty(lcooandsStc hcehmaarE*n tsrtyr_,t iCnotm*m IvnailtuAel,l Sscthreumcat[ ]k v=D i{c t *| ^~~~~~~~~~~~~~~~~d ict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrInt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ Default(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 45 warnings generated when compiling for gfx90a. 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx90a. 45 warnings generated when compiling for gfx940. 45 warnings generated when compiling for gfx900. 45 warnings generated when compiling for gfx1100. 45 warnings generated when compiling for gfx803. 45 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSucces/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ s; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx942. 45 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :i294n:t5*: rwarning: avariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]n k) {294 | | ^~~~~~~~~~~~~~~~~ def/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.ha:u229l:t14:: warning: | unused function 'ncclTopoXGMISpeed' [-Wunused-function] ^~~~~~~ 229/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc | :s32t:a1t:i cnote: in instantiation of function template specialization 'ncclKernel' requested heref lo a32t | InMcPcLl_TMoApIoNX_GKMEIRSNp(e)e;d ( c| o^n st c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:a373r:*3 :g cnote: nexpanded from macro 'IMPL_MAIN_KERN') { | 373 ^~~~~~~~~~~~~~~~~ | ncclKernIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.ccl:<14t: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.hu:e161>:(14c:o mwarning: munused function 'ncclGdrInit' [-Wunused-function], cha n161n | esltMaatsikc, gwdorr_ktH enacdc)l;G d\r I n| i ^t () {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 298| : ^~~~~~~~~~~34 : note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206: 21298: | warning: unused function 'ncclGdrCudaFree' [-Wunused-function] cop y206T | osSthamteimc1 6ncc(ltRieds%uWlAtR_Pt_ SnIcZcEl,G ddrsCtu,d asFrrce,e (bvyotieds* gdr)H;a n d| l ^~~e ) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr CLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx900. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for host. 28 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for host. 4 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* systIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ em, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx900. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx1102. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. 5 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* nodunused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ e, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx942. 17 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return nIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_tcclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collCoIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redmOmp),) ;v oriedt*u rsne nndcMchlaSnudclcee,s sv;o i}d * | r ^~~~~~~~~~~~~~~~e cvMhandle, void*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h*: 31r:e21q:u ewarning: sunused function 'collNetCloseListen' [-Wunused-function]t ) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :3128 | :s21t:a twarning: iunused function 'collNetIflush' [-Wunused-function]c ncclResul t28_ | ts tcaotlilcN entcCclloRseesLuilstt_etn (csotlrluNcett InfclculsCho(msmt*r uccotm mn,c cvloCiodm*m *l icsotmemn,C ovmomi)d *{ cNoClClLCCoHmEmC,K (vcooimdm*- >dnactcal,C oilnltN esti-z>ec,l ovsoeiLdi*s tmehna(nldilset,e nvCooimdm*)*) ;r erqeuteusrtn) n{c cNlCSCuLcCcHeEsCsK;( c}o m m| - ^~~~~~~~~~~~~~~~~~> ncclC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ho:l33l:N12e:t -warning: >unused function 'collNetSupport' [-Wunused-function]i flush(c o33l | lsCtoamtmi,c diantta ,c oslilzNee,t Smuhpapnodrlte(,s trreuqcute sntc)c)l;C ormemt*u rcno mnmc)c l{S urcecteusrsn; c}o m m| - ^~~~~~~~~~~~~> nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hc:l29C:o21l:l Nwarning: eunused function 'collNetTest' [-Wunused-function]t != nul l29p | tsrt a?t i1c :n c0c;l R}e s u| l ^~~~~~~~~~~~~~t _t collNetTest(In file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cct:r12u: c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.ht: 161n:c14c:l Cwarning: ounused function 'ncclGdrInit' [-Wunused-function]m m* com m161, | svtoaitdi*c rgedqru_ets tn,c cilnGtd*r Idnointe(,) i{n t *| ^~~~~~~~~~~s ize) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.ccN:C196C:L21C:H Ewarning: Cunused function 'collNetDumpMap' [-Wunused-function]K (comm -196> | nsctcaltCiocl lnNcectl-R>etseusltt(_rte qcuoelsltN,e tdDounmep,M aspi(zset)r)u;c tr ectounrnne cntcMcalpS*u cmcaeps)s ;{ } | ^~~~~~~~~~~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx803. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx940. 21 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx941. 21 warnings generated when compiling for gfx900. 21 warnings generated when compiling for gfx1030. 21 warnings generated when compiling for gfx906. 21 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx803. 11 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx90a. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx940. 11 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 25 warnings generated when compiling for gfx90a. 25 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ id%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ c##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffsIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ et = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll1In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 28Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ ARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizeIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ s[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(arg runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ s); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx7. warningxs/ generatedW when compiling for Agfx1102R. P_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | In file included from in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cppt: 1o: fIn file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:e10t: In file included from =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :t169i: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h;: 271 :| 19 ^: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t daIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ ta1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ :21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp10:: 1In file included from : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h168:: 10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h: :In file included from 153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::14169:: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hwarning: :unused variable 'data1' [-Wunused-variable]271 :19: warning: unused variable 'ptr' [-Wunused-variable] 153 | 271u | i n t 3 2 _ t duaitnat16,4 _ftl*a gp1t,r d=a trae2c,v Pftlra(g02);+ l l| 1 ^~~~~2 8Off/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hs:e153t:;21 : | warning: ^~~unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ , data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint3In file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp_:t1 : dIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:a101: ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hf:l169a: g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h1:,271 :d19a:t awarning: 2unused variable 'ptr' [-Wunused-variable], flag2; 271 | | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 :u35i:n twarning: 6unused variable 'flag2' [-Wunused-variable]4 _t* p153t | r = rueicnvtP3t2r_(t0 )d+altla112,8 Offlfasge1t,; d a| t ^~~a 2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h;: 514 :| 9 ^: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##alIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | rungo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ Ring(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ symmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads)In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPLE]/N:C562C:L15_:S Twarning: Einitializer order does not match the declaration order [-Wreorder-ctor]P S/sizeof(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n275t:h90r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthr e275a | d s ) , t iPdrIinmBiltoicvke(sts,t e/p*SDiizree(cntc=c*l/S0h,m ePmr.octoom,m .0b>u fpfrSiimzse s [| N ^C CL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:T595O:_5S:I Mnote: Pin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL E]/ N595C | C L _ S TrEuPnST/rseiezUepoDfo(wTn)<)T ,{ R e| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O p ,| group(groupP rotoSimple<1, 1>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:(275a:r90g:s )note: ;in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53P:r inote: min instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei tive s202< | T , R e d O p ,R uFnaWnoArskyEmlmeemternitc<o,t o/>*(D)i.rreucnt(=w*e/)0;, P| r ^o to, 0> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppp:r6i:m1s: note: | in instantiation of member function 'RunWork, 0, 2>::run' requested here ^ 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | :I595M:P5L:_ Cnote: Oin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL L_FU N595C | ( A l l RreudnuTcree,e UTpRDEoEw,n >(args); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAIlngBol,o cPkr(otthor>e(a)d.Irduxn.(xw)e,) ;g r o| u ^p (group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~5 : 1| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 563 | 5 | IsMtPeLp_SCiOzLeL(_nFcUcNlCS(hAmlelmR.ecdoumcme.,b uTfRfESEi,z eSsI[MNPCLCEL,_ PSRuOmTPOo_sStIDMiPvL,E ]u/iNnCtC8L__tS)T E P| S^/ sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:T391):)95 :{ note: expanded from macro 'IMPL_COLL_FUNC'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h<:n324c:c90l:F unote: nin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec ##fu n324c | , t y p e ,P rFiumnict#i#vdeesvF,a nNACsCyLm_mAeLtGrOi_c#<#1a,l gNoC,C LN_CMCALX__PDREOVT_OA_R#I#TpYr>o,t o/>*(D)i.rreucnt(=&*n/c0c,l SPhrmoetmo.,w o0r>k )p;r i\m s | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::595562::515:: note: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 595 | 562 | r u ntTirde(etUipdD)o,w nnc>k((atrhgrse)a;d I d| x ^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:(53g:r onote: uin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herep ), 202| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k Eleme n562t | < F n , tTi,d (RteiddO)p,, nAtlhgroe,a dPsr(onttoh>r(e)a.drsu)n,( wtei)d;I n B| l ^o ck(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppa:d5I:d1x:. xnote: )in instantiation of member function 'RunWork, 0, 2>::run' requested here, gro u5p | (IgMrPoLu_pC)O,L L _| F ^~~~~~~~~~~U NC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ iv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPos/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:Di562v:,15 :i nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]8 _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), 391n | t h rReuandWso(rnkto,u pN)C,C L _| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L G O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #algo ,563 | N C C L _sPtReOpTSOi_z#e#(pnrcoctloS>h(m)e.mr.ucno(m&mn.cbculfSfhSmiezme.sw[oNrCkC)L;_ P\R O T| O ^_ SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562]:/15N:C Cnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ STEPS /562s | i z e o ft(iTd)()t i{d ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s324):,90 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI nBlock (324t | h r e a d I dPxr.ixm)i,t igvreosu, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~o , NCCL_PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:p562r:o60t:o >note: (field 'group' will be initialized after field 'stepSize') .run(&nc c562l | S h m e mt.iwdo(rtki)d;) ,\ n t| h ^r eads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here _FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREEment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx908. 19 warnings generated when compiling for gfx940. 19 warnings generated when compiling for gfx941. 19 warnings generated when compiling for gfx90a. 19 warnings generated when compiling for gfx90a. 19 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for host. 19 warnings generated when compiling for gfx906. 19 warnings generated when compiling for gfx1100. 19 warnings generated when compiling for gfx900. 19 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffse/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht: 514=: 9W:i rwarning: evariable 'offset' set but not used [-Wunused-but-set-variable]W ord P514e | r S l ice*warp + 2*wid; | ^ int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPROTO_:#562#:p15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor]> ().run(&ncclS h562m | e m . w otrikd)(;t i\d ) ,| ^n threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's ), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I n B| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o ck(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~~~~~~~. buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60s:[ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PRO T562O | _ S I M PtLiEd](/tNiCdC)L,_ SnTtEhPrSe/asdisz(enotfh(rTe)a)d s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)275,: 90g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (grou p275) | , | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlockE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groupm), | ^~~~~~~~~~~~~~~~~m .buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:S562i:z60e:s [note: Nfield 'group' will be initialized after field 'stepSize'C CL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p275(:g90r:o unote: pin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , | ^~~~~~~~~~~ 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t15o:> (warning: )initializer order does not match the declaration order [-Wreorder-ctor]. run(&nc c562l | S h m e mt.iwdo(rtki)d;) ,\ n t| h ^r eads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s )note: ,field 'nthreads' will be initialized after field 'tidInBlock' tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~f fSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562[:N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'P ROTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t275h:r90e:a dnote: Iin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered x.x), 275g | r o u p ( g rPoruipm)i,t i v| e ^~~~~~~~~~~s , /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p ), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hem.co:m562m:.15b:u fwarning: finitializer order does not match the declaration order [-Wreorder-ctor]S izes[NCCL_PROTO _562S | I M P L Et]i/dN(CtCiLd_)S,T EnPtSh/rseiazdeso(fn(tTh)r)e a{d s )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(groupI nBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:.324x:)90,: gnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up(group )324, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r imit i563v | e s < T ,s tReepdSOipz,e (FnacncAlsSyhmmmeemt.rciocm_,S I/M*PDLiEr]e/cNtC=C*L/_0S,T EPPrSo/tsoi,z e0o>f (pTr)i)m s{ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :595275 | : 90 : note: rin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nTree U275p | D o w n < T ,P rRiemdiOtpi,v ePsrs>y(mamregtsr)i;c < N| C ^C L_MAX_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:V202_:A53R:I Tnote: Yin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here, 1>, /202* | D i r e c t = * /R0u,n WPorroktEol,e m0e>n tp, ProtoSimple<1, 1>>' requested herer oto >595( | ) . r u nr(uwneT)r;e e U| p ^D own, 0, 2>::run' requested hereo toSi m7p | lIeML>L(_aFrUgNsC)(;A l l| R ^e duce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:R202E:E53,: Snote: Iin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereM PLE ,202 | S u m , uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :39115 | : warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]u nWorkd,I nNBClCoLc_kA(LtGhOr_e#a#daIldgxo.,x )N,C CgLr_oPuRpO(TgOr_o#u#pp)r,o t o| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( ) .| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u n(&nc c563l | S h m e ms.tweoprSki)z;e (\n c c| l ^S hmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562b:u15f:f Snote: ifield 'nthreads' will be initialized after field 'tidInBlock'z es[NCC L562_ | P R O T Ot_iSdI(MtPiLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) )t i{d I n| B ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l o c| k group(group( threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:.324x:)90,: gnote: rin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up(gr o324u | p ) , | ^~~~~~~~~~~~~~~~~P rimi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562v:e60s:< Tnote: ,field 'group' will be initialized after field 'stepSize' RedOp ,562 | F a n A styimdm(ettirdi)c,< 1n,t hNrCeCaLd_sM(AnXt_hDrEeVa_dAsR)I,T Yt>i,d I/n*BDliorck(threadIdx.x), group(group), | ^~~~~~~~~~~ ect=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :9562 | :I15M:P Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]C OLL_FUNC( A562l | l R e d utcied,( tTiRdE)E,, nStIhMrPeLaEd,s (Snutmh,r ueads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C15L:_ Swarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]E PS/sizeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d324s:)90,: tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered InBlo c324k | ( t h r e a dPIrdixm.ixt)i,v egsre,m ./c*oDmimr.ebcutf=f*S/i0z,e sP[rNoCtCoL,_ P0R>O TpOr_iSmIsM P L| E ^] /NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S595T:E5P:S /note: sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herei zeof (595T | ) ) { r u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T r e| e group(groupU pDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Op, P r275o | t o S i m p lPerv>e(sa, 0, 2>::run' requested hereN CCL_ M202A | X _ D E V _ A R IRTuYn,W o1r>k,E l/e*mDeinrte gpor,i mPsr o t| o ^> ().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:n595(:w5e:) ;note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here | ^ 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp runT:r11e:e1U:p Dnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested herew ne>,( aTrRgEsE),; S I| M ^P LE, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:,202 :f53l:o anote: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here) | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : Rnote: uexpanded from macro 'IMPL_COLL_FUNC'n WorkE l391e | m e nRtu,( )F.urnucn#(#wdee)v;r e d| o ^p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp,: 13N:C1C:L _note: Ain instantiation of member function 'RunWork, 0, 2>::run' requested hereL GO_ #13# | aIlMgPoL,_ CNOCLCLL__FPURNOCT(OA_l#l#Rperdoutcoe>,( )T.RrEuEn,( &SnIcMcPlLSEh,m eSmu.mw,o rrkc)c;l _\b f l| o ^a t16) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391field 'nthreads' will be initialized after field 'tidInBlock': 95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | t391i | d ( tRiudn)W,o rnkt ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~o , N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60P:R Onote: Tfield 'group' will be initialized after field 'stepSize'O _##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreema.dwso(rnkt)h;r e\a d s| ) ^, ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~n threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmex.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthrea d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L E ]| / tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_STEPS/s i563z | e o f ( Ts)t)e p{S i z| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n c| c group(groupl Shmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.275b:u90f:f Snote: iin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez es[NC C275L | _ P R O T O _PSrIiMmPiLtEi]v/eNsC, /*Direct=/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h*:/2750:,90 :P rnote: oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o, 0> p r275i | m s | ^ Primi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i595v:e5s:< Tnote: ,in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here Red O595p | , F a nrAusnyTmrmeeetUrpiDcol,e e>c(ta=r*g/s0),; P r| o ^t o, 0>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :p202r:i53m:s note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here| ^ 202 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:W595o:r5k:E lnote: ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herem ent <595F | n , T ,r uRneTdrOepe,U pADlogwon,< TP,r oRteod>O(p),. rPurno(twoeS)i;m p l| e ^< 1, 1>>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppa:r5g:s1):; note: in instantiation of member function 'RunWork, 0, 2>::run' requested here| ^ 5 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P202L:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock's tepSize (562n | c c l S htmiedm(.tciodm)m,. bnutfhfrSeiazdess([nNtChCrLe_aPdRsO)T,O _tSiIdMIPnLBEl]o/cNkC(CtLh_rSeTaEdPISd/xs.ixz)e,o fg(rTo)u)p ({g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ) ,| group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :562324 | : 90 : note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d(tid), 324n | t h r e a d sP(rnitmhirteiavdess)<,T ,t iRdeIdnOBpl,o cFka(ntAhsryemamdeItdrxi.cx<)1,, gNrCoCuLp_(MgArXo_uDpE)V,_ A R| I ^~~~~~~~~~~T Y>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads(n t562h | r e a d st)i,d (ttiiddI)n,B lntohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 60g:r onote: ufield 'group' will be initialized after field 'stepSize'p (grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id) ,563 | n t h r esatdesp(Snitzher(enacdcsl)S,h mteimd.IcnoBmlmo.cbku(ftfhSriezaedsI[dNxC.CxL)_,P RgOrToOu_pS(IgMrPoLuEp])/,N C C| L ^~~~~~~~~~~_ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~o mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ k(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,i dN(CtCiLd_)A,L GnOt_h#r#eaaldgso(,n tNhCrCeLa_dPsR)O,T Ot_i#d#IpnrBoltooc>k(()t.hrruena(d&Indcxc.lxS)h,m egmr.owuopr(kg)r;o u\p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'563 | s562t | e p S i ztei(dn(ctcildS)h,m enmt.hcroemamd.sb(unftfhSriezaedss[)N,C CtLi_dPIRnOBTlOo_cSkI(MtPhLrEe]a/dNICdCxL._xS)T,E PgSr/osuipz(egorfo(uTp))), { | ^~~~~~~~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h group(group: 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 275t:i90d:( tnote: iin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nth r275e | a d s ( n t hPrreiamdist)i,v etsi, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Simple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads (n391t | h r eRaudnsW)o,r kt, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60A:L Gnote: Ofield 'group' will be initialized after field 'stepSize'_ ##alg o562, | N C C Lt_iPdR(OtTiOd_)#,# pnrtohtroe>a(d)s.(rnutnh(r&enacdcsl)S,h mteimd.IwnoBrlko)c;k (\t h r| e ^a dIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pnote: )field 'nthreads' will be initialized after field 'tidInBlock', | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: E, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ zes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_mPROTO._wSoIrMkP)L;E ]\/ N C| C ^L _STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562{: 15 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: field 'nthreads' will be initialized after field 'tidInBlock' | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t275i:d90(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nthr e275a | d s ( n t h rPeraidmsi)t,i vteisd60,: /note: *field 'group' will be initialized after field 'stepSize'D irect =562* | / 0 , Ptriodt(ot,i d0)>, pnrtihmrse a d| s ^( nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a595d:s5):, note: tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herei dInB l595o | c k ( t hrruenaTdrIedexU.pxD)o,w ng>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hNCCL_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>().run(& n562c | c l S h mteimd.(wtoirdk)),; n\t h r| e ^a ds(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Bloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~f fSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562s:[60N:C Cnote: Lfield 'group' will be initialized after field 'stepSize'_ PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:.324x:)90,: gnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up(gr o324u | p ) , | ^~~~~~~~~~~P rimitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ GO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n(562w:e15):; warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7: 1562: | note: in instantiation of member function 'RunWork, 0, 2>::run' requested here ti d7( | tIiMdP)L,_ CnOtLhLr_eFaUdNsC((nAtlhlrReeaddusc)e,, tTiRdEIEn,B lSoIcMkP(LtEh,r eMaadxI,d xu.ixn)t,3 2g_rto)u p (| g^r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 : 95| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563 | R u n WsotrekpI,M PNLCEC]L/_NACLCGLO__S#T#EaPlSg/os,i zNeCoCfL(_TP)R)O T{O _ #| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p r o| t group(groupo >().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l324S:h90m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herew ork); \324 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:r562i:m15i:t inote: vfield 'nthreads' will be initialized after field 'tidInBlock'e so,c k/(*tDhirreeacdtI=d*x/.0x,) ,P rgortoou,p (0g>r opurpi)m,s | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 595field 'group' will be initialized after field 'stepSize': 5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here562 | t595i | d ( t i dr)u,n TnrteherUepaDdosw(nnI>d(xa.rxg)s,) ;g r o| u ^p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]324 | Primitive s562< | T , R etdiOdp(,t iFda)n,A snytmhmreetardisc(t,h r/e*aDdiIrdexc.tx=)*,/ 0g,r oPurpo(tgor,o u0p>) ,p r i| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^ 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 595 : 5 :s tnote: ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herep Siz e595( | n c c l SrhumneTmr.eceoUmpmD.obwunfC>C(La_rSgTsE)P;S / s| i ^z eof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :{202 : 53| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here| group(group 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hW:o324r:k90E:l enote: min instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree nte(d)O.pr,u nF(awneA)s;y m m| e ^t ric<1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp :N7C:C1L:_ Mnote: Ain instantiation of member function 'RunWork, 0, 2>::run' requested hereX _DE V7_ | AIRMIPTLY_>C,O L/L*_DFiUrNeCc(tA=l*l/R0e,d uPcreo,t oT,R E0E>, pSrIiMmPsL E ,| ^M ax, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:35952:_5t:) note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here| ^ 595/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :r unote: nexpanded from macro 'IMPL_COLL_FUNC'T reeUpD o391w | n < TR,u nRWeodrOkp<,n cPcrloFtuoSinmcp#l#eft>y(paer,g sF)u;n c #| # ^d evredop/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:t202y:p53e:> ,note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereN CCL_ A202L | G O _ # # a l g oR,u nNWCoCrLk_EPlReOmTeOn_t#<#Fpnr,o tTo,> (R)e.drOupn,( &Anlcgcol,S hPmreomt.ow>o(r)k.)r;u n\( w e| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:: 7note: :field 'nthreads' will be initialized after field 'tidInBlock'1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 562 | 7 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eAaldlsR(endtuhcree,a dTsR)E,E ,t iSdIIMnPBLlEo,c kM(atxh,r euaidnItd3x2._xt)), g| r^o up(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n Work< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~t o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p (grou p563) | , | ^~~~~~~~~~~s tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:t562=:*15/:0 ,warning: initializer order does not match the declaration order [-Wreorder-ctor]P roto, 0> pri m562s | | ^ tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)595,: 5n:t hnote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree ads( n595t | h r e a drsu)n,T rteiedUIpnDBolwonc)>,( a r| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ) ;| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :s53t:e pnote: Sin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei ze(n c202c | l S h m e m . c oRmumn.WbourfkfESliezmeesn[tNP(S)/.sriuzne(owfe()T;) ) | { ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp group(group: 8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hM:P275L:_90C:O Lnote: Lin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ FUNC( A275l | l R e d u c eP,r iTmRiEtEi,v eSsI, /391* | D i rReucntW=o*r/k0<,n cPcrloFtuon,c #0#>f upnrci,m st y pe,| ^F unc##de/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hv:r595e:d5o:p , ProtoSimple<1, 1>>' requested herey pe>, 595N | C C L _ ALGO_##raulngTor,e eNUCpCDLo_wPnRr(o)t.orSuinm(p&lnece>m(.awrogrsk));; \| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::202562::5315:: note: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernetah(r)e.arduInd(xw.ex));, g| r ^o up(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp):,9 : 1| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :960 | :I Mnote: Pfield 'group' will be initialized after field 'stepSize'L _COLL _562F | U N C ( AtlildR(etdiudc)e,, nTtRhErEe,a dSsI(MnPtLhEr,e aMdasx),, utiindtI6n4B_lto)c k (| t^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup( g391r | o u pR)u,n W o| r ^~~~~~~~~~~k , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nt tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hmem.w:o562r:k15):; warning: \initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | ^~~~~~~~~~~~~~~~~563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e60p:S inote: zfield 'group' will be initialized after field 'stepSize'e (ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562391: | 15 : Rwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]n Worki,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (&ncc l563S | h m e m .swtoerpkS)i;z e\( n c| c ^l Shme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562c:o15m:m .note: bfield 'nthreads' will be initialized after field 'tidInBlock'u ffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r324o:u90p:( gnote: rin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up), | 324 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60P:r inote: mfield 'group' will be initialized after field 'stepSize'i tives <562T | , R e dtOipd,( tFiadn)A,s ynmtmherteraidcs<(1n,t hNrCeCaLd_sM)A,X _tDiEdVI_nABRlIoTcYk>(,t h/r*eDaidrIedcxt.=x*)/,0 ,g rPoruopt(og,r o0u>p )p,r i m| s ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ EV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppS:I1M: PIn file included from L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:,10 : MIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:,167 : i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t5628:_15t:) warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 95562: | note: expanded from macro 'IMPL_COLL_FUNC' tid(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ # #| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l go, N C563C | L _ P R OsTtOe_p#S#ipzreo(tnoc>c(l)S.hrmuenm(.&cnocmcml.SbhumfefmS.iwzoersk[)N;C C\L _ P| R ^O TO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15]:/ Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t275i:d90I:n Bnote: lin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo ck(th r275e | a d I d x . xP)r,i mgirtoiuvpe(sg, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives , /t*iDdi(rteicdt)=,* /n0t,h rPeraodtso(,n t0h>r epardism)s, t| i ^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:k595(:t5h:r enote: ain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hered Idx. x595) | , g r oruupn(TgrreoeuUpp)D,o w n| < ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T , | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e dOp, P563r | o t o S ismtpelpeSc>c(laSrhgmse)m;. c o| m ^m .buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:s53[:N Cnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereL _PRO T202O | _ S I M P L E ] /RNuCnCWLo_rSkTEElPeSm/esnitz().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:(324w:e90):; note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp324: | 5 : 1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested hereP rim i5t | iIvMePsL<_TC,O LRLe_dFOUpN,C (FAalnlARseydmumceet,r iTcR_,t )/ * D| i^r ect=/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h*:/3910:,95 :P rnote: oexpanded from macro 'IMPL_COLL_FUNC't o, 0> 391p | r i mRsu n W| o ^r k, ProtoSimple<1, 1>>' requested herec , ty p595e | , F u nrcu#n#TdreeverUepdDoopwR,e dNOCpC,L _PArLoGtOo_S#i#mapllgeo<,1 ,N C1C>L>_(PaRrOgTsO)_;# # p| r ^o to>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202(:&53n:c cnote: lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereS hmem .202w | o r k ) ; \ R| u ^n WorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:e562m:e15n:t r(e)a.drsu(nn(twher)e;a d s| ) ^, tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppo:c4k:(1t:h rnote: ein instantiation of member function 'RunWork, 0, 2>::run' requested herea dIdx. x4) | ,I MgPrLo_uCpO(LgLr_oFuUpN)C,( A l| l ^~~~~~~~~~~~~~~~~R ed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitivesi,d (/t*iDdi)r,e cntt=h*r/e0a,d sP(rnotthor,e a0d>s )p,r itmisd I n| B ^l ock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t595h:r5e:a dnote: Iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hered x.x )595, | g r o urpu(ngTrroeuepU)p,D o w| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~< T ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R edOp, P563r | o t o S ismtpelpeSc>c(laSrhgmse)m;. c o| m ^m .buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i202z:e53s:[ Nnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereC L_P R202O | T O _ S I M P L ER]u/nNWCoCrLk_ESlTeEmPeSn/ts().run(we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :324:90/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:: 6note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 3246 | | I M P L _ CPOrLiLm_iFtUiNvCe(sA391,: 95/:* Dnote: iexpanded from macro 'IMPL_COLL_FUNC'r ect= *391/ | 0 , RPurnoWtoor,k <0n>c cplrFiumnsc # #| f ^u nc, type/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 595F:u5n:c #note: #in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hered evred o595p | < t y p er>u,n TNrCeCeLU_pADLoGwOn_<#T#,a lRgeod,O NCpC,L _PPrRoOtToOS_i#m#pplreo (1)>.>r(uanr(g&sn)c;c l S| h ^m em.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h\: 202 :| 53 ^: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 202note: | field 'nthreads' will be initialized after field 'tidInBlock' 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppg:r5o:u1p:) ,note: in instantiation of member function 'RunWork, 0, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h5: | 562I:M60P:L _note: Cfield 'group' will be initialized after field 'stepSize'O LL_ F562U | N C ( A ltliRde(dtuicde),, TnRtEhEr,e aSdIsM(PnLtEh,r eMaidns,) ,u itnitd8I_ntB)l o c| k^( thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(g r391o | u p )R,u n W| o ^~~~~~~~~~~r k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreSads),i zteisd[INnCBClLo_cPkR(OtThOr_eSaIdMIPdLxE.]x/)N,C CgLr_oSuTpE(PgSr/osuipz)e,o f (| T ^~~~~~~~~~~) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^ :562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)60 : note: field 'group' will be initialized after field 'stepSize' 563 | s562t | e p S i ztei(dn(ctcildS)h,m enmt.hcroemamd.sb(unftfhSriezaedss[)N,C CtLi_dPIRnOBTlOo_cSkI(MtPhLrEe]a/dNICdCxL._xS)T,E PgSr/osuipz(egorfo(uTp))), { | ^~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~15 : | warning: group(groupinitializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 324 :t90i:d (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d), nt h324r | e a d s ( n tPhrriemaidtsi)v,e st, /* D563i | r e c t =s*t/e0p,S iPzreo(tnoc,c l0S>h mpermi.mcso m m| . ^b uffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e595s:[5N:C Cnote: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here_ PROT O595_ | S I M P LrEu]n/TNrCeCeLU_pSDToEwPnS>(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:g275s:)90;: note: | in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^ 275/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 : note: Pin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer imit i202v | e s < T , R e dROupn,W oFraknEAlseymmemnett>,( )/.*rDuinr(ewcet)=;* / 0| , ^ Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp,: 60:>1 :p rnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested herem s | ^6 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L595_:C5O:L Lnote: _in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereF UNC( A595l | l R e d urcuen,T rTeReEUEp,D oSwInM:>391(:a95r:g snote: )expanded from macro 'IMPL_COLL_FUNC'; | ^ 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202W:o53r:k , 0, 2>::run' requested herec clFu n202c | # # f u n c , tRyupneW,o rFkuEnlce#m#ednetvO,p ,N CAClLg_oA,L GPOr_o#t#oa>l(g)o.,r uNnC(CwLe_)P;R O T| O ^_ ##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppt:o8>:(1):. rnote: uin instantiation of member function 'RunWork, 0, 2>::run' requested heren (&nc c8l | SIhMmPeLm_.CwOoLrLk_)F;U N\C ( A| l ^l Redu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:e562,: 15T:R Enote: Efield 'nthreads' will be initialized after field 'tidInBlock', SIMP L562E | , M i nt,i di(ntti6d4)_,t )n t h| r^e ads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC') , tidI n391B | l o cRku(ntWhorreka562, | N C C Lt_iAdL(GtOi_d#)#,a lngtoh,r eNaCdCsL(_nPtRhOrTeOa_d#s#)p,r ottiod>I(n)B.lroucnk((&tnhcrcelaSdhImdexm..xw)o,r kg)r;o u\p ( g| r ^o up)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :T562R:E15E:, warning: Sinitializer order does not match the declaration order [-Wreorder-ctor]I MPLE, Min ,562 | i n t 6 4t_itd)( t i| d^) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s (note: nexpanded from macro 'IMPL_COLL_FUNC't hreads )391, | t iRduInnWBolrokc, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heads(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor] s), tidInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e15s:[ Nwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]C L_PROTO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r324e:a90d:I dnote: xin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. x), g324r | o u p ( g r oPurpi)m,i t i| v ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e s <| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), RedOp ,563 | F a n A ssytmempeStirziec(e,s [/N*CDCiLr_ePcRtO=T*O/_0S,I MPPrLoEt]o/,N C0C>L _pSrTiEmPsS / s| i ^z eof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:)595): 5{: note: | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group595 | runTre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:U324p:D90o:w nnote: , FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT , RedOp ,324 | P r o t o S iPmrpilmei<>T(,a rRgesd)O;p , | F ^a nAsymm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:t202r:i53c:< 1note: ,in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here NCC L202_ | M A X _ D E V _ ARRuInTWYo>r,k E/l*eDmiernetcl gpor,i mPsr o t| o ^> ().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:)595;: 5 :| ^note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp: 8595: | 1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested herer unT r8e | eIUMpPDLo_wCnOM>P(LaEr,g sM)i;n , | i ^n t64_t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h^: 202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | R uRnuWnoWrokro(p)<.tryupne(>w,e )N;C C L| _ ^A LGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppg:o8,: 1N:C Cnote: Lin instantiation of member function 'RunWork, 0, 2>::run' requested here_ PROTO _8# | #IpMrPoLt_oC>O(L)L._rFUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562e:n15t:< Fwarning: ninitializer order does not match the declaration order [-Wreorder-ctor], T, Red O562p | , A l gtoi,d (Ptriodt)o,> (n)t.hrruena(dwse()n;t h r| e ^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppt:i9d:I1n:B lnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested herec k(t h9r | eIaMdPILd_xC.OxL)L,_ FgUrNoCu(pA(lglrRoeudpu)c,e , | T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R E E| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) SIMP L563E | , M i ns,t eupiSnitz6e4(_ntc)c l S| h^m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:m391m:.95b:u fnote: fexpanded from macro 'IMPL_COLL_FUNC'S izes[ N391C | C L _RPuRnOWToOr_kS, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:L275_:A90L:G Onote: _in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #algo ,275 | N C C L _ P RPOrTiOm_i#t#ipvreost,( )R.erduOnp(,& nFcacnlASshymmemme.twroirck<)N;C C\L _ M| A ^X _DEV_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:I562T:Y15,: 1note: >field 'nthreads' will be initialized after field 'tidInBlock', /*Di r562e | c t = * /t0i,d (Ptriodt)o,, n0t>h rperaidmss( n t| h ^r eads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t595i:d5I:n Bnote: lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereo ck( t595h | r e a d Irduxn.Txr)e,e UgprDoouwpn(> (562a | r g s ) ;t i d| ( ^t id), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 0, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h1::562 :note: 15in instantiation of member function 'RunWork, 0, 2>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 9 | IMPL_COL L562_ | F U N C (tAildl(Rteiddu)c,e ,n tThRrEeEa,d sS(InMtPhLrEe,a dMsi)n,, tuiidnItn64B_lto)c k (| t^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(gr o391u | p ) ,R u n| W ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o r k| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n cclFu n563c | # # f u nsct,e ptSyipzee,( nFcucnlcS#h#mdeemv.rceodmomp.i,z eNsC[CNLC_CALL_GPOR_O#T#Oa_lSgIoM,P LNEC]C/LN_CPCRLO_TSOT_E#P#Sp/rsoitzoe>o(f)(.Tr)u)n ({& n c| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l S h| m group(groupe m.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 275 ^: 90: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 562:15: note: 275field 'nthreads' will be initialized after field 'tidInBlock' | 562P | r i m i ttiivde(stx,) ,/ *gDrioruepc(tg=r*o/u0p,) ,P r o| t ^~~~~~~~~~~~~~~~~o , 0>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :p562r:i60m:s note: field 'group' will be initialized after field 'stepSize'| ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 595 : 5 :t inote: din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here( tid) ,595 | n t h r eraudnsT(rnetehUrpeDaodwsn)<,T ,t iRdeIdnOBpl,o cPkr(otthorSeiamdpIldex<.1x,) ,1 >g>r(oaurpg(sg)r;o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 562 | RtuindW(otrikdE)l,e mnetnhtr((t)h.rreuand(Iwdex).;x ) ,| ^g roup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppu:p9):,1 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'RunWork, 0, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 9 | IM P563L | _ C O L Ls_tFeUpNSCi(zAel(lnRcecdluSchem,e mT.RcEoEm,m .SbIuMfPfLSEi,z eMsi[nN,C CuLi_nPtR6O4T_Ot_)S I M| P^L E]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391S:T95E:P Snote: /expanded from macro 'IMPL_COLL_FUNC's izeof( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:T562,: 15R:e dwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]p , ProtoSi m562p | l e < 1 ,t i1d>(>t(iadr)g,s )n;t h r| e ^a ds(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereI nBlo c202k | ( t h r e a d I dRxu.nxW)o,r kgErloeumpe(ngtrt(e)p.Sriuzne((wnec)c;l S h| m ^e m.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppu:f9f:S1i:z enote: sin instantiation of member function 'RunWork, 0, 2>::run' requested here[ NCCL _9P | RIOMTPOL__SCIOMLPLL_EF]U/NNCC(CALl_lSRTeEdPuSc/es,i zTeRoEfE(,T )S)I M{P L E| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ M i| n group(group, uint64_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h): 275 :| 90^: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 95275: | note: expanded from macro 'IMPL_COLL_FUNC' Primi t391i | v e sRp,e >/,* DNiCrCeLc_tA=L*G/O0_,# #Parlogtoo,, N0C>C Lp_rPiRmOsT O _| # ^# proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)595.:r5u:n (note: &in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heren cclS h595m | e m . w orrukn)T;r e\e U p| D ^o wnt>i(da(rtgisd));, n| t ^h reads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: )in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here, tid I202n | B l o c k ( t h rReuandWIodrxk.Exl)e,m egnrtofield 'group' will be initialized after field 'stepSize'( ).ru n562( | w e ) ; t i| d ^( tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppt:h9r:e1a:d snote: (in instantiation of member function 'RunWork, 0, 2>::run' requested heren thre a9d | sI)M,P Lt_iCdOILnLB_lFoUcNkC((tAhlrleRaeddIudcxe.,x )T,R EgEr,o uSpI(MgPrLoEu,p )M,i n ,| ^~~~~~~~~~~u int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_ALG:O562_:#15#:a lwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]o , NCCL_PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~_ SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:]60/:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _STEP S562/ | s i z e otfi(dT()t)i d{) , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:n275B:l90o:c knote: (in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreadId x275. | x ) , g r oPurpi(mgirtoiuvpe)s,< T ,| ^~~~~~~~~~~R edOp, FanAsymmetric, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ imple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitiveswarning: ,initializer order does not match the declaration order [-Wreorder-ctor] /*Direct= *562/ | 0 , P rtoitdo(,t i0d>) ,p rnitmhsr e a| d ^s (nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d595s:)5,: tnote: iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested hered InBl o595c | k ( t h rreuandTIrdexe.Uxp)D,o wgnr > ( a rsgtse)p;S i z| e ^( ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202c:o53m:m .note: bin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu ffSi z202e | s [ N C C L _ P RROuTnOW_oSrIkMEPlLeEm]e/nNtC (| ) group(group. run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h;: 275 :| 90 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp :27510 | : 1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested here Prim i10t | iIvMePsL<_TC,O LRLe_dFOUpN,C (FAalnlARseydmumceet,r iTcR , | /^* Direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:=391*:/950:, note: Pexpanded from macro 'IMPL_COLL_FUNC'r oto, 0 >391 | p r iRmusn W o| r ^k , ProtoSimple<1, 1>>' requested herec , t y595p | e , F urnucn#T#rdeeevUrpeDdoowpn<e,d ONpC,C LP_rAoLtGoOS_i#m#pallegC>L(_aPrRgOsT)O;_ # #| p ^r oto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:u202n:(53&:n cnote: cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herel Shme m202. | w o r k ) ; \ R u| n ^W orkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:n562t:<15F:n ,note: field 'nthreads' will be initialized after field 'tidInBlock'T , RedO p562, | A l g ot,i dP(rtoitdo)>,( )n.trhurne(awdes)(;n t h| r ^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppt:i11d:I1n:B lnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested herec k(t h11r | eIaMdPILd_xC.OxL)L,_ FgUrNoCu(pA(lglrRoeudpu)c,e , | T ^~~~~~~~~~~~~~~~~R EE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P60L:E ,note: field 'group' will be initialized after field 'stepSize'M in, f l562o | a t ) t| i^d (tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :n95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a ds(nt h391r | e a dRsu)n,W otrikd, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p(group )562, | | ^~~~~~~~~~~~~~~~~ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)60,: nnote: tfield 'group' will be initialized after field 'stepSize'h reads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g563r | o u p ( gsrtoeuppS)i,z e (| n ^~~~~~~~~~~c clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBloc k13( | tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,T R E| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, S| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PLE, M563i | n , r csctle_pbSfilzoea(tn1c6c)l S h| m^e m.comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:S391i:z95e:s [note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_PROT O391_ | S I MRPuLnEW]o/rNkC:,90 :N Cnote: Cin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _ALGO _275# | # a l g o , PNrCiCmLi_tPiRvOeTsO<_T#,# pRreodtOop>,( )F.arnuAns(y&mnmcectlrSihcm, //usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h*:D562i:r15e:c tnote: =field 'nthreads' will be initialized after field 'tidInBlock'* /0, P r562o | t o , 0t>i dp(rtiimds) , | n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(595n:t5h:r enote: ain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hered s), t595i | d I n B lroucnkT(rteherUepaDdoIwdnx<.Tx,) ,R egdrOopu,p (PgrrootuopS)i,m p l| e ^~~~~~~~~~~~~~~~~< 1, 1>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:(562a:r60g:s )note: ;field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei d), n202t | h r e a d s ( n tRhurneWaodrsk)E,l etmiednItng(r)o.urpu)n,( w e| ) ^~~~~~~~~~~; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T)) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r324e:a90d:s (note: nin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread s324) | , t i d I nPBrliomcikt(itvhersep,S i/z*eD(inrceccltS=h*m/e0m,. cPormomt.ob,u f0f>S ipzreism[sN C C| L ^_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:_595S:I5M:P Lnote: Ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here] /NCC L595_ | S T E P Sr/usniTzreeoefU(pTD)o)w n{< T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| O group(groupp , ProtoSimple<1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 3241:>90>:( anote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s); | 324 ^ | Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:m202i:t53i:v enote: sin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here< T, Re d202O | p , F a n A s yRmumneWtorrikcEl,g o/,* DPirroetcot>=(*)/.0r,u nP(rwoet)o; ,| ^0 > prims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp :| 13 ^: 1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :595:5 :13 | note: Iin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereM PL_ C595O | L L _ F UrNuCn(TArleleRUepdDuocwen,< TT,R EREe,d OSpI,M PPLrEo,t oMSiinm,p lrecl>o(aatr1g6s)) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202expanded from macro 'IMPL_COLL_FUNC': 53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 391 | R202u | n W o r k < n c cRluFnuWnocr#k#Efluenmce,n tto,> (N)C.CrLu_nA(LwGeO)_;# # a| l ^g o, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp_:P13R:O1T:O _note: #in instantiation of member function 'RunWork, 0, 2>::run' requested here# proto >13( | )I.MrPuLn_(C&OnLcLc_lFSUhNmCe(mA.lwloRrekd)u;c e\, T| R ^E E, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15,: Mnote: ifield 'nthreads' will be initialized after field 'tidInBlock'n , rcc l562_ | b f l o atti1d6()t i d| )^, nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95(:n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads), 391t | i d IRnuBnlWoocrkk(field 'group' will be initialized after field 'stepSize', NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolck(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.huin:t3866:49_:t *warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable]p tr = r e386c | v P t r (i0n)t+ lwli1r2e8OOffffsseett ;= W| i ^~~r eWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~562 :15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :\562 : 15| : ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :562 | note: field 'nthreads' will be initialized after field 'tidInBlock' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~~~~~~~ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t60e:p Snote: ifield 'group' will be initialized after field 'stepSize'z e(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize( { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flagIn file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp;: 1 : | In file included from ^~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h153::16835: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :warning: 153unused variable 'flag2' [-Wunused-variable]: 14: warning: unused variable 'data1' [-Wunused-variable]153 | u i153n | t 3 2 _ tu idnatt3a21_,t fdlaatga11,, dfaltaag21,, fdlaatga22;, f| l ^~~~~a g2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recIn file included from v/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cppP:t1r(: 0In file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h+:l10l: 1In file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h8:O169f: fs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.het:;271 : 19| : ^~~ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(In file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppc:c1l: SIn file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:e10m: .In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ho:m169m: ./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hb:u271f:f19S:i zwarning: eunused variable 'ptr' [-Wunused-variable]s [NCCL_P R271O | T O _ S I M P L Eu]i/nNtC6C4L__tS*T EpPtSr/ s=i zreeocfv(PTt)r)( 0{) + l| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~1 2 8| O group(groupf fset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 562 | tRiudn(Wtoirdk)E,l enmtehnrtet(h)r.eraudnI(dwxe.)x;) , | g ^r oup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppu:p4):,1 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'RunWork, 0, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 4 | I M563P | L _ C O LsLt_eFpUSNiCz(eA(lnlcRceldSuhcmee,m .TcRoEmEm,. bSuIfMfPSLiEz,e sP[rNeCMCuLl_SPuRmO,T Oi_nStI8M_PtL)E ] /| N^C CL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:P391S:/95s:i znote: eexpanded from macro 'IMPL_COLL_FUNC'o f(T)) {391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R u n| W group(groupo rk, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, type, 275F | u n c # # d ePvrriemdiotpi,, RNeCdCOLp_,A LFGaOn_A#s#yamlmgeot,r iNcCT(Y),. r1u>n,( &/n*cDcilrSehcmte=m*./w0o,r kP)r;o t\o , | 0 ^> prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: 562note: | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here t595i | d ( t i dr)u,n TnrteherUepaDdosw(nnI>d(xa.rxg)s,) ;g r o| u ^p (group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^~~~~~~~~~~~~~~~~53 : note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60 :202 | note: field 'group' will be initialized after field 'stepSize' 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, :S562I:M15P:L Ewarning: ,initializer order does not match the declaration order [-Wreorder-ctor] PreMulSum, i562n | t 8 _ t )t i d| (^t id),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n391t:h95r: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: warning: | initializer order does not match the declaration order [-Wreorder-ctor] group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 275 : 90t:i dnote: (in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id), n275t | h r e a d s (Pnrtihmrietaidvse)s,< Tt,i dRIendBOlpo,c kF(atnhArseyamdmIedtxr.ixc)<,N CgCrLo_uMpA(Xg_rDoEuVp_)A,R I T| Y ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, 1| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), /*Di r563e | c t = * /s0t,e pPSriozteo(,n c0c>l Sphrmiemms. c o| m ^m .buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e595s:[5N:C Cnote: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here_ PROT O595_ | S I M P LrEu]n/TNrCeCeLU_pSDToEwPnS>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:a275r:g90s:) ;note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53P:r inote: min instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei tive s202< | T , R e d O p ,R uFnaWnoArskyEmlmeemternitc<o,t o/>*(D)i.rreucnt(=w*e/)0;, P| r ^o to, 0>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :p7r:i1m:s note: in instantiation of member function 'RunWork, 0, 2>::run' requested here| ^ 7 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:M595P:L5_:C Onote: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL _FU N595C | ( A l l RreudnuTcree,e UTpRDEoEw,n )> ( a| r^g s); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: 391in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here | Ru n202W | o r k < n c c l FRuunncW#o#rfkuEnlce,m etnytpe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]R unWorkEl e562m | e n t < Ftni,d (Tt,i dR)e,d Onpt,h rAelagdos,( nPtrhorteoa>d(s)).,r utni(dwIen)B;l o c| k ^( threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppd:I6d:x1.:x )note: ,in instantiation of member function 'RunWork, 0, 2>::run' requested here gro u6p | (IgMrPoLu_pC)O,L L _| F ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~U N C| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)A llRedu c563e | , T R EsEt,e pSSIiMzPeL(En,c cPlrSehMmuelmS.ucmo,m mi.nbtu3f2f_Sti)z e s| [^N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:R391O:T95O:_ Snote: Iexpanded from macro 'IMPL_COLL_FUNC'M PLE]/ N391C | C L _RSuTnEWPoSr/ks, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep 324, | N C C L _ APLrGiOm_i#t#iavlegso<,T ,N CRCeLd_OPpR,O TFOa_n#A#spyrmomteot>r(i)c.\, /| * ^D irect=*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:0562,: 15P:r onote: tfield 'nthreads' will be initialized after field 'tidInBlock'o , 0> p562r | i m s t| i ^d (tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n595t:h5r:e anote: din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heres (nth r595e | a d s ) ,r utniTdrIeneBUlpoDcokw(nt >| ( ^~~~~~~~~~~~~~~~~a rgs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 : 60| : ^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562202 | : 53 : note: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei d(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~d Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primit/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:v562e:s15<:T ,warning: initializer order does not match the declaration order [-Wreorder-ctor]R edOp, Fa n562A | s y m m ettirdi(ct),, /t*iDdiIrneBclto=c*k/(0t,h rPeraodtIod,x .0x>) ,p rgirmosu p (| g ^r oup),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 595| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~5 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595563 | | rsutneTprSeiezUep(DnocwcnlR>O(TaOr_gSsI)M;P L E| ] ^/ NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:E202P:S53/:s inote: zin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree of( T202) | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn WorkElement, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree dOp, A l275g | o , P r o tPor>i(m)i.triuvne(sw, 0, 2>::run' requested herem etri c9< | NICMCPLL__MCAOXL_LD_EFVU_NACR(IATlYl,R e1d>u,c e/,* DTiRrEeEc,t =S*I/M0P,L EP,r oPtroe,M u0l>S upmr,i musi n t| 6 ^4 _t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :595:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC'595 | r391u | n T rReuenUWpoDrokwd>e(varregdso)p;< t y| p ^e >, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:A202L:G53O:_ #note: #in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herea lgo, 202N | C C L _ P R O T OR_u#n#WporroktEol>e(m)e.nrtu().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w15e:) ;note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :t7i:d1(:t inote: din instantiation of member function 'RunWork, 0, 2>::run' requested here) , nt h7r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( AtlildRIendBulcoec,k (TtRhErEe,a dSIIdMxP.LxE),, PgrreoMuupl(Sgurmo,u pu)i,n t 3| 2 ^~~~~~~~~~~~~~~~~_ t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :field 'group' will be initialized after field 'stepSize'391 :95: note: expanded from macro 'IMPL_COLL_FUNC'562 | t391i | d ( tRiudn)W,o rnktg,r oNuCpC(Lg_rAoLuGpO)_,# # a| l ^~~~~~~~~~~g o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] group(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15a:l gwarning: oinitializer order does not match the declaration order [-Wreorder-ctor], NCCL_PROT O562_ | # # p r ottiod>((t)id), nthreads(nt.hrruena(d&sn)c,c ltSihdmIenmB.lwoocrkk()t;h r\e a d| I ^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:( gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid( t563i | d ) , nsttherpeSaidzse((nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~~~~~~~P S/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f60(:T )note: )field 'group' will be initialized after field 'stepSize' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t324h:r90e:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( nthre a324d | s ) , t i dPIrniBmliotcikv(etsh, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primit/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: vnote: ein instantiation of member function 'RunWorkElement, 0, 2>::run' requested heres o,, /P*rDoitroe>c(t)=.*r/u0n,( wPer)o;t o ,| ^0 > pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppm:s9 : 1| : ^ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_595F:U5N:C (note: Ain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herel lReduc e595, | T R E Er,u nSTIrMePeLUEp,D oPwrne391>:(95a:r gnote: sexpanded from macro 'IMPL_COLL_FUNC') ; | ^ 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202W:o53r:k , 0, 2>::run' requested herec clFu n202c | # # f u n c , tRyupneW,o rFkuEnlce#m#ednetvO,p ,N CAClLg_oA,L GPOr_o#t#oa>l(g)o.,r uNnC(CwLe_)P;R O T| O ^_ ##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppt:o7>:(1):. rnote: uin instantiation of member function 'RunWork, 0, 2>::run' requested heren (&n c7c | lISMhPmLe_mC.OwLoLr_kF)U;N C\( A l| l ^R ed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,15 :T Rnote: Efield 'nthreads' will be initialized after field 'tidInBlock'E , SIMP L562E | , P r etMiudl(Stuimd,) ,u inntth3r2e_atd)s ( n| t^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :t95i:d Inote: nexpanded from macro 'IMPL_COLL_FUNC'B lock(t h391r | e a dRIudnxW.oxr)k,< ngcrcoluFpu(ngcr#o#ufpu)n,c , | t ^~~~~~~~~~~~~~~~~y pe, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:u562n:c60#:# dnote: efield 'group' will be initialized after field 'stepSize'v redop< t562y | p e tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, NC:C562L:_15A:L Gwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ ##algo, NCCL_P R562O | T O _ # #tpirdo(ttoi>d()),. rnutnh(r&enacdcsl(Snhtmherme.awdosr)k,) ;t i\d I n| B ^l ock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(group )562, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid) ,563 | n t h r esatdesp(Snitzher(enacdcsl)S,h mteimd.IcnoBmlmo.cbku(ftfhSriezaedsI[dNxC.CxL)_,P RgOrToOu_pS(IgMrPoLuEp])/,N C C| L ^~~~~~~~~~~~~~~~~_ STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i60z:e onote: ffield 'group' will be initialized after field 'stepSize'( T)) { 562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t group(groupi d(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s275(:n90t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds), t275i | d I n B l o cPkr(itmhirteiavdeIsd, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hOL:L562_:F15U:N Cwarning: (initializer order does not match the declaration order [-Wreorder-ctor]A llReduce, TREE, S562I | M P L E ,t iPdr(etMiudl)S,u mn,t hurienatd6s4(_ntt)h r e| a^d s), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391I:n95B:l onote: cexpanded from macro 'IMPL_COLL_FUNC'k (threa d391I | d x .Rxu)n,W ogrrkoS,h mNeCmC.Lc_oAmLmG.Ob_u#f#faSligzoe,s [NNCCCCLL__PPRROOTTOO__#S#IpMrPoLtEo]>/(N)C.CrLu_nS(T&EnPcSc/lsSihzmeeomf.(wTo)r)k ){; \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562: | 275 : 90 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid) ,275 | n t h r e a dPsr(inmtihtrievaedss<)T,, tRieddIOnpB,l oFcakn(AtshyrmemaedtIrdixc., //usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h*:D562i:r60e:c tnote: =field 'group' will be initialized after field 'stepSize'* /0, P r562o | t o , 0t>i dp(rtiimds) , | n ^t hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t595h:r5e:a dnote: sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here) , ti d595I | n B l o crku(ntThrreeeaUdpIDdoxw.nx<)T,, gRreoduOpp(,g rPoruopt)o,S i m| p ^~~~~~~~~~~l e<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::324324::9090:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324324 | | PPrriimmiittiivveess<>,, //**DDiirreecctt==**//00,, PPrroottoo,, 00>> pprriimmss | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::595595::55:: note: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595595 | | rruunnTTrreeeeUUppDDoowwnn<>>>((aarrggss));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herein instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp::98::11:: note: note: in instantiation of member function 'RunWork, 0, 2>::run' requested herein instantiation of member function 'RunWork, 0, 2>::run' requested here 98 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, TTRREEEE,, SSIIMMPPLLEE,, PPrreeMMuullSSuumm,, uiinntt6644__tt)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<>,, NNCCCCLL__AALLGGOO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]s tepSize(n c562c | l S h m etmi.dc(otmimd.)b,u fnftShirzeeasd[sN(CnCtLhread_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 324s:t90e:p Snote: iin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez e(nccl S324h | m e m . c o mPmr.ibmuiftfiSviezse| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ / *| D group(groupi rect=*/0,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :P275r:o90t:o ,note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here0 > prim s275 | | ^ Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:m595i:t5i:v enote: sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here< T, R e595d | O p , FraunnATsryememUeptDroiwcn<<,1 ,/ *1D>i>r(eacrtg=s*)/;0 , | P ^r oto, 0/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>: 202p:r53i:m snote: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 595 : 5 : note: Rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereu nWo r595k | E l e m ernutnS(i)m.prluen<(1w,e )1;> > (| a ^r gs); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp| : ^10 :1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :1053 | :I Mnote: Pin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereL _COL L202_ | F U N C ( A l l RReudnuWcoer,k ETlReEmEe,n tS^( ).run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:w391e:)95;: note: | expanded from macro 'IMPL_COLL_FUNC' ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp | : 13 :R1u:n Wnote: oin instantiation of member function 'RunWork, 0, 2>::run' requested herer kE,, NPCrCeLM_uAlLSGuOm_,# #raclcglo_,b fNlCoCaLt_1P6R)O T O| _^# #prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:>391(:)95.:r unote: nexpanded from macro 'IMPL_COLL_FUNC'( &ncclSh m391e | m . wRournkW)o;r k\< n c| c ^l Func##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:u562n:c15,: tnote: yfield 'nthreads' will be initialized after field 'tidInBlock'p e, Fun c562# | # d e v rteiddo(ptn,t hNrCeCaLd_sA(LnGtOh_r#e#aadlsg)o,, tNiCdCILn_BPlRoOcTkO(_t#h#rperadIdxo.txo)>,( )g.rrouunp((&gnrcoculpS)h,m e m| . ^~~~~~~~~~~~~~~~~w or/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:)562;: 60\: note: | field 'group' will be initialized after field 'stepSize' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~r oup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitivesr,e a/d*sD(inrtehcrte=a*d/s0),, PtriodtIon,B l0o>c kp(rtihmrse a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :g595r:o5u:p (note: gin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herer oup )595, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ r u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T reeUpD o563w | n < T , sRteedpOSpi,z eP(rnoctcolSSihmmpelme.b>u(fafrSgisz)e;s [ N| C ^C L_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53S:I Mnote: Pin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereL E]/ N202C | C L _ S T E P S /RsuinzWeoorfk(ETl)e)m e{n t <| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n , | T group(group, RedOp, Algo, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t324o:>90(:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(we); 324| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :P13r:i1m:i tnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested herev es< T13, | IRMePdLO_pC,O LFLa_nFAUsNyCm(mAeltlrRiecd, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(&ncc:l562S:h15m:e mwarning: .initializer order does not match the declaration order [-Wreorder-ctor]w ork); \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~p Siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:(562n:c60c:l Snote: hfield 'group' will be initialized after field 'stepSize'm em.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthr Proetoa>d(s)(.nrtuhnr(ewaed)s;) , | t ^i dInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppd:I13d:x1.:x )note: ,in instantiation of member function 'RunWork, 0, 2>::run' requested here grou p13( | gIrMoPuLp_)C,O L L| _ ^~~~~~~~~~~F UNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, 0, 2>::run' requested heree, F unc##devre d13o | pIO,L LN_CFCULN_CA(LAGlOl_R#e#daulcgeo,, TNRCECEL,_ PSRIOMTPOL_E#,# pPrroetMou>l(S)u.mr,u nr(c&cnlc_cblfSlhomaetm1.6w)o r k| )^; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :expanded from macro 'IMPL_COLL_FUNC'15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 391 | 562 | R u n Wtoirdk(d,I dNxC.CxL)_,A LgGrOo_u#p#(aglrgoou,p )N,C C L| _ ^~~~~~~~~~~~~~~~~P RO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:#60#:p rnote: ofield 'group' will be initialized after field 'stepSize't o>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h reads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h10:: 514In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h9::168 : warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hvariable 'offset' set but not used [-Wunused-but-set-variable]: 153:14: warning: 514unused variable 'data1' [-Wunused-variable] | int offs e153t | = t iudi;n t 3| 2 ^_ t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx11017 warnings generated when compiling for gfx90a. . 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m15.:b uwarning: finitializer order does not match the declaration order [-Wreorder-ctor]f Sizes[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLe_aSdTsE(PnSt/hsriezaedosf)(,T )t)i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)68,: 56g:r onote: uin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herep (gro u68p | ) , | P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r i m| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t ives< T563, | R e d Ospt,e pFSainzSey(mnmcectlrSihcm.,c o0m,m .PbruoftfoS,i z0e>s [pNrCiCmLs_ P R| O ^T O_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:L588E:]5/:N Cnote: Cin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereL _ST E588P | S / s i zreuonfR(iTn)g)< T{, R| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d O p| , group(group Proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:r68g:s56):; note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here| ^ 68 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :P53r:i mnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heret ives <202T | , R e d O p , RFuannWSoyrmkmEeltermiecnF,n ,0 ,T ,P rRoetdoO,p ,0 >A lpgroi,m sP r o| t ^o >()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:u588n:(5w:e )note: ;in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here | ^ 588 | r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cppu:n5R:i1n:g , 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(argsT, RedOp, Proto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_ALGO_##a l562g | o , N CtCiLd_(PtRiOdT)O,_ #n#tphrroetaod>s(()n.trhurne(a&dnsc)c,l SthimdeImn.Bwloorckk)(;t h\r e a| d ^I dx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,note: field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 60 : note: tfield 'group' will be initialized after field 'stepSize'i d(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60):, note: field 'group' will be initialized after field 'stepSize'| ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:dx562.:x15):, warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup(group), 562 | | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~p Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h0:,562 :P15r:o twarning: oinitializer order does not match the declaration order [-Wreorder-ctor], 0> prim s562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d588):,5 :n tnote: hin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herer ead s588( | n t h r eraudnsR)i,n gtI(daxr.gxs)),; g r| o ^u p(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)202,: 53 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | sRtuenpWSoirzkeE(lnecmcelnSth, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, ha/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lf) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, | ^~~~~~~~~~~: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562 :| 15^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'562 | t i391d | ( t iRdu)n,W onrtkhg,r oNuCpC(Lg_rAoLuGpO)_,# # a| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g o ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_P R563O | T O _ # #sptreoptSoi>z(e)(.nrcucnl(S&hnmcecml.Schommemm..bwuofrfkS)i;z e\s [ N| C ^C L_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:S15I:M Pnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'E ]/NCC L562_ | S T E P St/isdi(zteiodf)(,T )n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ( n| t group(grouph reads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i68d:I56n:B lnote: oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herec k(th r68e | a d I d xP.rxi)m,i tgirvoeusp<(Tg,r oRuepd)O,p , | F ^~~~~~~~~~~~~~~~~a nSym/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562t:r60i:c , 0, P562r | o t o , t0i>d (ptriidm)s, n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n588t:h5r:e anote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heres ), t588i | d I n B lroucnkR(itnhgr((garrogusp));, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Proto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P15S:/ swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z eof(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h68r:e56a:d snote: (in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heren thre a68d | s ) , tPirdiImniBtliovceks(),, 0 ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P r o| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o , 0> p563r | i m s s| t ^e pSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n588c:c5l:S hnote: min instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree m.c o588m | m . b u frfuSniRziensg[L(Ea]r/gNsC)C;L _ S| T ^EPS/s izeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):)202 :{53 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | group(group 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 68 : 56R:u nnote: Win instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereo rkEl e68m | e n t < FPnr,i mTi,t iRveedsOy(m)m.erturni(cw;, 0| , ^ Proto,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp :011>: 1p:r inote: min instantiation of member function 'RunWork, 1, 2>::run' requested heres | ^ 11 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_588C:O5L:L _note: Fin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereU NC( A588l | l R e d urcuen,R iRnIgNf(laoragts)) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 391 | 202 | R u n W o r k < nRcucnlWFournkcE#l#efmuennct,< Ftny,p eT,, FRuendcO#p#,d eAvlrgeod,o pP>(,) .NrCuCnL(_wAeL)G;O _ #| # ^a lgo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cppC:L9_:P1R:O Tnote: Oin instantiation of member function 'RunWork, 1, 2>::run' requested here_ ##p r9o | tIoM>P(L)_.CrOuLnL(_&FnUcNcCl(SAhlmleRme.dwuocrek,) ;R I\N G ,| ^S IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:,562 :M15a:x ,note: field 'nthreads' will be initialized after field 'tidInBlock'u int64 _562t | ) | ^t id(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads(nt h391r | eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' # 562f | u n c , ttiydp(et,i dF)u,n cn#t#hdreevardesd(onpts,) ,N CtCiLd_IAnLBGlOo_c#k#(atlhgroe,a dNICdCxL._xP)R,O TgOr_o#u#pp(rgortoou>p()),. r u| n ^~~~~~~~~~~( &ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)In file included from , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp| : ^~~~~~~~~~~1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), oto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:es562[:N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]P ROTO_SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d68I:d56x:. xnote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here, gro u68p | ( g r o uPpr)i,m i t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~v e s| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T , Red O563p | , F a nsStyempmSeitzrei(cnl,S h0m,e mP.rcootmom,. b0u>f fpSriizmess [ N| C ^C L_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:I588M:P5L:E ]note: /in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereN CCL _588S | T E P S /rsuinzReionfg((args)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h;: 68 :| 56 ^: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :68202 | : 53 : note: Pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer imit i202v | e s < T , R e dROupn,W oFraknESlyemmmeenttrT,, 0R,e dPOpr,o tAol,g o0,> Pprroitmos> ( )| . ^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):;588 : 5| : ^ note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp | : 12 : 1 :r unote: nin instantiation of member function 'RunWork, 1, 2>::run' requested hereR ing< T12, | IRMePdLO_pC,O LPLr_oFtUoN>C((aArlglsR)e;d u c| e ^, RING, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I202M:P53L:E ,note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereP reMu l202S | u m , d o u b lReu)n W o| r^k Elemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:<391F:n95,: Tnote: ,expanded from macro 'IMPL_COLL_FUNC' RedOp, A l391g | o , RPurnoWtoor>k(<)n.crculnF(uwnec)#;# f u| n ^c , typ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cppe:,9 :F1u:n cnote: #in instantiation of member function 'RunWork, 1, 2>::run' requested here# dev r9e | dIoMpPL,_ FNUCNCCL(_AAlLlGROe_d#u#cael,g oR,I NNGC,C LS_IPMRPOLTEO,_ #P#rperMoutloS>u(m),. ruuinn(t&64n_ctc)l S h| m^e m.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391):;95 :\ note: expanded from macro 'IMPL_COLL_FUNC'| ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rnote: kfield 'nthreads' will be initialized after field 'tidInBlock'< ncclFunc #562# | f u n c ,t itdy(ptei,d )F,u nnct#h#rdeeavdrse(dnotph),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~n (&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562S:h60m:e mnote: .field 'group' will be initialized after field 'stepSize'w ork); 562\ | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~d Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ 17 warnings/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp: generated10 when compiling for :gfx11021. : note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ht: 386d:a9t:a 1warning: ,variable 'wireOffset' set but not used [-Wunused-but-set-variable] flag 1386, | d a t ai2n,t fwliarge2O;f f s| e ^~~~~t = Wi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hr:e153W:o28r:d Pwarning: eunused variable 'data2' [-Wunused-variable]r Slice *153w | a r p +u i2n*tw3i2d_;t d| a ^t a1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp,: 1N: CIn file included from C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.hL:_9P: RIn file included from O/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hT:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreadO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp271: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h : 9 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hu:i168n: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h6:4153_:t14*: pwarning: tunused variable 'data1' [-Wunused-variable]r = recvPtr (1530 | ) + l l 1u2i8nOtf3f2s_ett ;d a t| a ^~~1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ otoLL128>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:In file included from 35/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:: 1warning: : unused variable 'flag2' [-Wunused-variable]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10 : 153In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 168 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hu:i153n:t143:2 _warning: tunused variable 'data1' [-Wunused-variable] data1, f153l | a g 1 , udiantta322,_ tf ldaagt2a;1 , | f ^~~~~l ag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, fl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ha:g3861:,9 :d awarning: tvariable 'wireOffset' set but not used [-Wunused-but-set-variable]a 2, f386l | a g 2 ; i n| t ^~~~~ wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRi7n warninggs< generatedT when compiling for ,gfx1101 . RedOp, ProtoLL128>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloatIn file included from 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp6): 1| : ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: :In file included from 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::95169:: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hnote: :expanded from macro 'IMPL_COLL_FUNC'509 :29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]391 | RunWork, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. type>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hargs-:>562s:e15n:d bwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]f f, args- >562r | e c v b utfifd,( t i| d ^) , nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ads) ,202 | t i d I n B l o cRku(ntWhorrekaEdlIedmxe.nxt)<,F ng,r oTu,p (RgerdoOupp,) ,A l g| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, P| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o to>(). r563u | n ( w e )s;t e p| S ^i ze(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppS:h4m:e1m:. cnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herem m.bu f4f | SIiMzPeLs_[CNOCLCLL__FPURNOCT(OA_lSlIRMePdLuEc]e/,N CCCOLL_LSNTEETP_SD/IsRiEzCeTo,f (STI)M)P L{E , | S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u m P| o group(groups tDiv, int8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| :^677 :11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 391:95: note: expanded from macro 'IMPL_COLL_FUNC'677 | 391 | pRruinmWso(rtkieo>u,t ,N CdCiLr_eAcLtG-O>_d#o#wanl,g oa,r gNsC-C>Ls_ePnRdObTuOf_f#,# parrogtso->>(r)e.crvubnu(f&fn,c c l| S ^h mem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 :\53 : | note: ^in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' Run W562o | r k E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp :g4r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup), 4 | | I ^~~~~~~~~~~~~~~~~M PL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:_60F:U Nnote: Cfield 'group' will be initialized after field 'stepSize'( AllRe d562u | c e , CtOiLdL(NtEiTd_)D,I RnEtChTr,e aSdIsM(PnLtEh,r eSaudmsP)o,s ttDiidvI,n Bilnotc8k_(tt)h r e| a^d Idx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group) ,391 | | ^~~~~~~~~~~R unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThred,u cNeC,C Ln_uAlLlGpOt_r#,# a&ldgior,e cNtC-C>Lo_uPtR,O TaOr_g#s#-p>rsoetnod>b(u)f.fr,u na(r&gnsc-c>lrSehcmvebmu.fwfo,r k )| ; ^ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15202: | note: field 'nthreads' will be initialized after field 'tidInBlock' Ru n562W | o r k E lteimde(nttid(I)n.Brluonc(kw(et)h;r e a| d ^I dx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppg:r4o:u1p:( gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up), 4 | | I ^~~~~~~~~~~~~~~~~M PL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:_60F:U Nnote: Cfield 'group' will be initialized after field 'stepSize'( AllRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h rSeuamdPso)s,t Dtiivd,I niBnlto8c_kt()t h r| e^a dIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x391):,95 :g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p(grou p391) | , R| u ^~~~~~~~~~~n Work, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group((group),n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 15| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t isdt(etpiSdi)z,e (nntchcrleSahdmse(mn.tchormema.dbsu)f,f StiizdeIsn[BNlCoCcLk_(PtRhOrTeOa_dSIIdMxP.LxE)],/ NgCrCoLu_pS(TgErPoSu/ps)i,z e o| f ^~~~~~~~~~~~~~~~~( T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562{: 60 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' | group(group 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626n:t9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nthr e626a | d s ) , t i d IpnrBilmosc(kt(itdh-rteiaddSItdaxr.txS)c,a tgtreoru,p (ngTrhoruepa)d,s S c| a ^~~~~~~~~~~t ter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:e562m:e15n:t n(t)h.rreuand(sw(en)t;h r e| a ^d s), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppB:l5o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eadIdx .5x | )I,M PgLr_oCuOpL(Lg_rFoUuNpC)(,A l l| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e d u| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e , C O563L | L N E T _sDtIeRpESCiTz,e (SnIcMcPlLSEh,m eSmu.mcPoomsmt.Dbiuvf,f Suiiznets8[_NtC)C L _| P^R OTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I391M:P95L:E ]note: /expanded from macro 'IMPL_COLL_FUNC'N CCL_S T391E | P S /RsuinzWeoorfk(, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herev redop <641t | y p e >, N C C L _ A LpGrOi_m#s#(atligdo-,t iNdCSCtLa_rPtRROeTdOu_c#e#,p rnoTthor>e(a)d.srRuend(u&cnec,c ldSihrmeecmt.-w>odrokw)n;, \& d i| r ^e ct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:t562,: 15a:r gnote: sfield 'nthreads' will be initialized after field 'tidInBlock'- >send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI nBlo c202k | ( t h r e a d I dRxu.nxW)o,r kgErloeumpe(ngtr (562) | . r u n (twied)(;t i d| ) ^, nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppa:d4s:(1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads) ,4 | tIiMdPILn_BClOoLcLk_(FtUhNrCe(aAdlIldRxe.dxu)c,e ,g rCoOuLpL(NgErTo_uDpI)R,E C T| , ^~~~~~~~~~~ SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e15p:S iwarning: zinitializer order does not match the declaration order [-Wreorder-ctor]e (ncclShmem. c562o | m m . b utfifdS(itzieds)[,N CnCtLh_rPeRaOdTsO(_nStIhMrPeLaEd]s/)N,C CtLi_dSITnEBPlSo/cski(ztehorfe(aTd)I)d x{. x )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g r| o group(groupu p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p666):,9 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 666 | 563 | sptreipmSsi(ztei(dn,c cnlTShhrmeeamd.scGoamtmh.ebru,f fdSiirzeecst[-N>CuCpL,_ PNRUOLTLO,_ SaIrMgPsL-E>]s/eNnCdCbLu_fSfT,E PaSr/gssi-z>eroefc(vTb)u)f f{, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :202641 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Run W641o | r k E l e m e n t < Fpnr,i mTs,( tRiedd-Otpi,d SAtlagrot,R ePdruocteo,> (n)T.hrruena(dwseR)e;d u c| e ^, dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppc:t5-:>1d:o wnote: nin instantiation of member function 'RunWork, 2, 2>::run' requested here, &dir e5c | tI-M>PoLu_tC,O LaLr_gFsU-N>Cs(eAnldlbRuefdfu,c ea,r gCsO-L>LrNeEcTv_bDufIfR,E C T| , ^ SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:u202m:P53o:s tnote: Din instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei v, u202i | n t 8 _ t ) | R^u nWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:E391l:e95m:e nnote: texpanded from macro 'IMPL_COLL_FUNC'< Fn, T ,391 | R e dROupn,W oArlkg#(#)f.urnucn,( wtey)p;e , | F ^u nc##de/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppv:r5e:d1o:p , 2, 2>::run' requested herey pe>, 5N | CICMLP_LA_LCGOOL_L#_#FaUlNgCo(,A lNlCRCeLd_uPcReO,T OC_O#L#LpNrEoTt_oD>I(R)E.CrTu,n (S&InMcPcLlES,h mSeumm.PwoosrtkD)i;v ,\ u i| n ^t 8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'nthreads' will be initialized after field 'tidInBlock': 391:95: 562note: | expanded from macro 'IMPL_COLL_FUNC' ti d391( | t i dR)u,n Wnotrhkrr,o uNpC(CgLr_oAuLpG)O,_ # #| a ^~~~~~~~~~~~~~~~~l go,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C60L:_ Pnote: Rfield 'group' will be initialized after field 'stepSize'O TO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhkr)e;a d\s ) ,| ^t idInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x) ,562 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 655:11: 666note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | p r i m s ( t i d , pnrTihmrse(atdisdG-attihdeSrt,a rdtiRreedcutc-e>,u pn,T hNrUeLaLd,s Raerdgusc-e>,s ennudlblupftfr,, a&rdgisr-e>crte-c>vobuutf,f ,a r g| s ^- >sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 :a rnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ->r e202c | v b u f f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :562 | note: field 'group' will be initialized after field 'stepSize' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | ^~~~~~~~~~~563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnBloc:k562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s (note: nfield 'group' will be initialized after field 'stepSize't hrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adId x563. | x ) , gsrtoeuppS(igzreo(unpc)c,l S h| m ^~~~~~~~~~~e m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIRECT:,562 :S15I:M Pwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]E , SumPostDi v562, | i n t 8t_itd)( t i| d^) , nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:( nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# d ev| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e d o| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)< type> ,563 | N C C L _sAtLeGpOS_i#z#ea(lngcoc,l SNhCmCeLm_.PcRoOmTmO._b#u#fpfrSoitzoe>s([)N.CrCuLn_(P&RnOcTcOl_SShImMePmL.Ew]o/rNkC)C;L _\S T E| P ^S /size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:f562(:T15):) note: {field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 562 group(group | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e626a:d9s:( nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh read s626) | , t i d I n B lporcikm(st(htrieda-dtIiddxS.txa)r,t Sgcraotutpe(rg,r onuTph)r,e a d| s ^~~~~~~~~~~~~~~~~S cat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562r:,60 :N Unote: Lfield 'group' will be initialized after field 'stepSize'L , di r562e | c t - > utpi,d (atrigds)-,> snetnhdrbeuafdfs,( natrhgrse-a>drse)c,v btuifdfI,n B l| o ^c k(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.202x:)53,: gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up(gr o202u | p ) , | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hOTO_##p:r562o:t15o:> (warning: ).initializer order does not match the declaration order [-Wreorder-ctor]r un(&ncclShmem.w o562r | k ) ; \t i d| ( ^t id), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: (field 'nthreads' will be initialized after field 'tidInBlock'n threa d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adIdx .563x | ) , g rsotuepp(Sgirzoeu(pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.60b:u fnote: ffield 'group' will be initialized after field 'stepSize'S izes[ N562C | C L _ P RtOiTdO(_tSiIdM)P,L En]t/hNrCeCaLd_sS(TnEtPhSr/esaidzse)o,f (tTi)d)I n{B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h group(groupr eadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(687g:r11o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: inote: in instantiation of member function 'RunWork, 2, 2>::run' requested here n t68 | _ItM)P L _| C^O LL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:l391R:e95d:u cnote: eexpanded from macro 'IMPL_COLL_FUNC', COLLNE T391_ | D I RREuCnTW,o rSkIexpanded from macro 'IMPL_COLL_FUNC', NCCL_ A391L | G O _R#u#naWlogrok,< nNcCcClLF_uPnRcO#T#Of_u#n#cp,r ottyop>e(,) .Fruunnc(#&#ndcecvlrSehdmoepm<.twyoprek>),; N\C C L| _ ^A LGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:o562,: 15N:C Cnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's ), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~I nBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd Idx.x) ,562 | g r o u pt(igdr(otiudp)),, n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs[:N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:(666t:h9r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI dx.x )666, | g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads G563a | t h e r ,s tdeiprSeiczte-(>nucpc,l SNhUmLeLm,. caormgms.-b>usfefnSdibzuefsf[,N CaCrLg_sP-R>OrTeOc_vSbIuMfPfL,E ] /| N ^C CL_STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:/202s:i53z:e onote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( T)) {202 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:l666e:m9e:n tnote: , FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF n, T, 666R | e d O p , A l gpor,i mPsr(ottiod>,( )n.Trhurne(awdes)G;a t h| e ^r , dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppc:t5-:>1u:p ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereN ULL ,5 | aIrMgPsL-_>CsOeLnLd_bFuUfNfC,( AalrlgRse-d>urceec,v bCuOfLfL,N E T| _ ^D IRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I202M:P53L:E ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS umPo s202t | D i v , u i n tR8u_ntW)o r k| E^l emen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:<391F:n95,: Tnote: ,expanded from macro 'IMPL_COLL_FUNC' RedOp, 391A | l g oR,u nPWroortko<>n(c)c.lrFuunn(cw#e#)f;u n c| , ^ type, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppF:u5n:c1#:# dnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herev redo p5< | tIMPLy_pCeO>L,L _NFCUCNLC_(AALlGlOR_e#d#uacleg,o ,C ONLCLCNLE_TP_RDOITROE_C#T#,p rSoItMoP>L(E),. rSuunm(P&onsctcDliSvh,m eumi.nwto8r_kt)); \| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::95562:: 15note: :expanded from macro 'IMPL_COLL_FUNC' note: field 'nthreads' will be initialized after field 'tidInBlock' 391 | 562 | R u n Wtoirdk(d,I dNxC.CxL)_,A LgGrOo_u#p#(aglrgoou,p )N,C C L| _ ^~~~~~~~~~~~~~~~~P ROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#60p:r onote: tfield 'group' will be initialized after field 'stepSize'o >(). r562u | n ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'n Block( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:,562 :i15n:t 3warning: 2initializer order does not match the declaration order [-Wreorder-ctor]_ t) | ^ 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgerdooupp<)t,y p e| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_AL G563O | _ # # a lsgtoe,p SNiCzCeL(_nPcRcOlTSOh_m#e#mp.rcootmom>.(b)u.frfuSni(z&ensc[cNlCSChLm_ePmR.OwToOr_kS)I;M P\L E ]| / ^N CCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:i562z:e15o:f (note: Tfield 'nthreads' will be initialized after field 'tidInBlock') ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 | | group(group tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677n:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nthr e677a | d s ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,B cgarsotu,p (ngTrhoruepa)d,s B c| a ^~~~~~~~~~~~~~~~~s t, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h&:d562i:r60e:c tnote: -field 'group' will be initialized after field 'stepSize'> out, 562d | i r e c tt-i>dd(otwind,) ,a rngtsh-r>esaednsd(bnutfhfr,e aadrsg)s,- >triedcIvnbBulfofc,k ( t| h ^r eadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x202.:x53):, note: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer oup( g202r | o u p ) , | ^~~~~~~~~~~R unWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hds), :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:R562O:T15O:_ #warning: #initializer order does not match the declaration order [-Wreorder-ctor]p roto>(). r562u | n ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nnote: Bfield 'nthreads' will be initialized after field 'tidInBlock'l ock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: Sfield 'group' will be initialized after field 'stepSize'I MPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B687l:o11c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readI d687x | . x ) , g r o u p (pgrriomusp()t,i d -| t ^~~~~~~~~~~i dStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize' :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:i562z:e15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPos/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:D562i:v15,: iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t 32_t) | 562^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthreads (391n | t h rReuandWso)r,k ,563 | N C C L _sAtLeGpOS_i#z#ea(lngcoc,l SNhCmCeLm_.PcRoOmTmO._b#u#fpfrSoitzoe>s([)N.CrCuLn_(P&RnOcTcOl_SShImMePmL.Ew]o/rNkC)C;L _\S T E| P ^S /sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: note: | field 'nthreads' will be initialized after field 'tidInBlock' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i641d:(11t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthr e641a | d s ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteRaeddIudcxe.,x )n,T hgrreoaudps(Rgerdouucpe),, d i| r ^~~~~~~~~~~~~~~~~e ct->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:o562w:n60,: ¬e: dfield 'group' will be initialized after field 'stepSize'i rect-> o562u | t , a rtgisd-(>tsiedn)d,b unftfh,r eaardgss(-n>trhercevabdusf)f,, t i| d ^I nBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t202h:r53e:a dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered x.x) ,202 | g r o u p ( g r oRuupn)W,o r k| E ^~~~~~~~~~~l ement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562S:t15a:r twarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]e duce, nThr e562a | d s R e dtuicde(,t indu)l,l pnttrh,r e&adirect->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:s (note: nfield 'nthreads' will be initialized after field 'tidInBlock't hreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (threadI d563x | . x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~m .co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:m562.:b60u:f fnote: Sfield 'group' will be initialized after field 'stepSize'i zes[ N562C | C L _ P RtOiTdO(_tSiIdM)P,L En]t/hNrCeCaLd_sS(TnEtPhSr/esaidzse)o,f (tTi)d)I n{B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h group(groupr eadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u655p:(11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562w:e15):; warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6: 1562: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here ti d6( | tIiMdP)L,_ CnOtLhLr_eFaUdNsC((nAtlhlrReeaddusc)e,, tCiOdLILnNBElTo_cDkI(RtEhCrTe,a dSIIdMxP.LxE),, SgurmoPuops(tgDriovu,p )i,n t 3| 2 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ t )| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :s tnote: eexpanded from macro 'IMPL_COLL_FUNC'p Size(nc c391l | S h mReumn.Wcoormkm<.nbcucflfFSuinzce#s#[fNuCnCcL,_ PtRyOpTeO,_ SFIuMnPcL#E#]d/eNvCrCeLd_oSpTi,z eNoCfC(LT_)A)L G{O _ #| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a l g| o group(group, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:r687o:t11o:> (note: )in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. run(&n c687c | l S h m e m . w o r kp)r;i m\s ( t| i ^d -tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:t562a:r15t:B cnote: afield 'nthreads' will be initialized after field 'tidInBlock's t, n T562h | r e a d stBicda(stti,d )&,d inrtehcrte-a>dosu(tn,t hnruelaldpst)r,, tairdgIsn-B>lsoecnkd(btuhfrfe,a daIrdgxs.-x>)r,e cgvrbouufpf(,g r o| u ^p ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::202562::5360:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herefield 'group' will be initialized after field 'stepSize' 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAInlBgloo,c kP(rtohtroe>a(d)I.drxu.nx()w,e )g;r o u| p ^( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp| : ^~~~~~~~~~~6 :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C15L:_ Swarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]E PS/sizeof( T562) | ) { t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ( t| i group(groupd ), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a666d:s9(:n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads) ,666 | t i d I n B l o cpkr(itmhsr(etaiddI,d xn.Txh)r,e agdrsoGuapt(hgerro,u pd)i,r e c| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- > u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), NULL ,563 | a r g s -s>tseepnSdibzuef(fn,c calrSghsm-e>mr.eccovmbmu.fbfu,f f S| i ^z es[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS IMP L202E | ] / N C C L _ S TREuPnSW/osrikzEeloefm(eTn)t)< F{n , | T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, R| e group(groupd Op, Algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 655P:r11o:t onote: >in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( ).run (655w | e ) ; | ^ pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppi:m6s:(1t:i dnote: -in instantiation of member function 'RunWork, 2, 2>::run' requested heret idSt a6r | tIRMePdLu_cCeO,L Ln_TFhUrNeCa(dAslRleRdeudcuec,e ,n uClOlLpLtNrE,T _&DdIiRrEeCcTt,- >SoIuMtP,L Ea,r gSsu-m>PsoesntdDbiuvf,f ,i natr3g2s_-t>)r e c| v^b uff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :39153 | : note: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu nWor k202< | n c c l F u n c #R#ufnuWnocr,k Etlyepmee,n tF,, PNrCoCtLo_>A(L)G.Or_u#n#(awleg)o;, N| C ^C L_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp_:#7#:p1r:o tnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here> ().run (7& | nIcMcPlLS_hCmOeLmL._wFoUrNkC)(;A l\l R e| d ^u ce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L15N:E Tnote: _field 'nthreads' will be initialized after field 'tidInBlock'D IREC T562, | S I M PtLiEd,( tSiudm)P,o snttDhirve,a dusi(nntt3h2r_eta)d s )| ,^ tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l391o:c95k:( tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eadIdx. x391) | , gRruonuWpo(rgkrn,t hNrCeCaLd_sA(LnGtOh_r#e#aadlsg)o,, tNiCdCILn_BPlRoOcTkO(_t#h#rperaodtIod>x(.)x.)r,u ng(r&onucpc(lgSrhomuepm).,w o r| k ^~~~~~~~~~~) ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nt h563r | e a d s (snttehprSeiazdes()n,c ctliSdhImneBml.occokm(mt.hbruefafdSIidzxe.sx[)N,C CgLr_oPuRpO(TgOr_oSuIpM)P,L E ]| / ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ STEPS /563s | i z e o fs(tTe)p)S i{z e (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c c l| S group(grouph mem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hb:u677f:f11S:i znote: ein instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres [NCCL_ P677R | O T O _ S I M P L E ]p/rNiCmCsL(_tSiTdE-PtSi/dsSitzaerotfB(cTa)s)t ,{ n T| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups Bcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:d666i:r9e:c tnote: -in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> out, 666d | i r e c t - > d opwrni,m sa(rtgisd-,> sneTnhdrbeuafdfs,G aatrhgesr-,> rdeicrvebcutf-f>,u p ,| ^NULL, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:e202n:d53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, arg s202- | > r e c v b u f fR,u n W| o ^r kElement, 2, 2>::run' requested hered Op, Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp,: 6R:e1d:O pnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Algo ,6 | PIrMoPtLo_>C(O)L.Lr_uFnU(NwCe()A;l l R| e ^d uce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppL:N7E:T1_:D Inote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested hereE CT, S I7M | PILMEP,L _SCuOmLPLo_sFtUDNiCv(,A lilnRte3d2u_cte), C| O^L LNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:_391D:I95R:E Cnote: Texpanded from macro 'IMPL_COLL_FUNC', SIMP L391E | , SRuumnPWoosrtkD , RNuCnCWLo_rAkLe(v)r.erduonp(<&tnycpcel>S,h mNeCmC.Lw_oArLkG)O;_ #\# a l| g ^o , N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Onote: Tfield 'nthreads' will be initialized after field 'tidInBlock'O _##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreema.dwso(rnkt)h;r e\a d s| ) ^, tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ad/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork),, NnCtChLr_eAaLdGsO(_n#t#harlegaod,s )N,C CtLi_dPIRnOBTlOo_c#k#(ptrhorteoa>d(I)d.xr.uxn)(,& ngcrcoluSph(mgermo.uwpo)r,k ) ;| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~\ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p15S:i znote: efield 'nthreads' will be initialized after field 'tidInBlock'( ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'group' will be initialized after field 'stepSize'626 :9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 | t i626d | ( t i d ) , n tphrriemasd(st(indt-htriedaSdtsa)r,t StciadtItneBrl,o cnkT(htrheraedasdSIcdaxt.txe)r,, gNrUoLuLp,( gdrioruepc)t,- > u| p ^~~~~~~~~~~, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 2_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:w owarning: rinitializer order does not match the declaration order [-Wreorder-ctor]k ); \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup( g563r | o u p ) ,s t e| p ^~~~~~~~~~~~~~~~~S ize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:c562c:l60S:h mnote: efield 'group' will be initialized after field 'stepSize'm .comm .562b | u f f S itzieds([tNiCdC)L,_ PnRtOhTrOe_aSdIsM(PnLtEh]r/eNaCdCsL)_,S TtEiPdSI/nsBilzoecokf((tTh)r)e a{d I d| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. x )| , group(group group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o626u:p9):, note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^~~~~~~~~~~ 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'group' will be initialized after field 'stepSize': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhreads(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->recvbu f562f | , | ^t id(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds( n202t | h r e a d s ) , RtuindWIonrBklEolcekm(etnhtr| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (we); 563 | | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppp:S7i:z1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested heree (ncc l7S | hImMePmL._cCoOmLmL._bFuUfNfCS(iAzlelsR[eNdCuCcLe_,P RCOOTLOL_NSEITM_PDLIER]E/CNTC,C LS_ISMTPELPES,/ sSiuzmePoofs(tTD)i)v ,{ u i| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :2| 562_ group(group:t 15): warning: | initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h^ : 655:11:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 562391in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | : 95 : note: 655texpanded from macro 'IMPL_COLL_FUNC' | i d ( t i d391 ) | , nR tuphnrrWieomarskd(s(r,g, r oN&uCdpCi)Lr,_e Ac Lt| G- ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O> _o #u| #t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a, l gao r,563g | sN -C >C sL e_snPtdRebOpuSTfiOfz_,e# (#anpcrrcgolstS-oh>>mr(ee)mc..vrcbuounmf(mf&.,nb cu cf| lf ^SS himzeems/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.[:w202No:Cr53Ck:L) _;note: P in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR\ O T O| 202_ ^ | S I M P L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h E: ] 562/ :NR15Cu:Cn LWnote: _ofield 'nthreads' will be initialized after field 'tidInBlock'Sr TkEEPl Se562/m | se in zt e< oFtfni(,dT ()Tt),i d{R) e, d | On ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pt ,h r| Ae group(grouplagdos,( nPtrhorteoa>d(s)).,r utni(dwIen)B;l o c| k ^( threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppd:x8.:x1):, note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup(g r8o | uIpM)P,L _ C| O ^~~~~~~~~~~~~~~~~L L_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:U562N:C60(:A lnote: lfield 'group' will be initialized after field 'stepSize'R educe ,562 | C O L L NtEiTd_(DtIiRdE)C,T ,n tShIrMePaLdEs,( nStuhmrPeoasdtsD)i,v ,t iidnItn6B4l_otc)k ( t| h^r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup( g391r | o u pR)u,n W o| r ^~~~~~~~~~~k , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 562: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]t id(tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e (ncclS h563m | e m . c osmtme.pbSuifzfeS(inzcecsl[SNhCmCeLm_.PcRoOmTmO._bSuIfMfPSLiEz]e/sN[CNCCLC_LS_TPERPOST/Os_iSzIeMoPfL(ET])/)N C{C L _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E P| S group(group/ sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:)666): 9{: note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 666 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:i677m:s11(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nThread s677G | a t h e r , d i r epcrti-m>su(pt,i dN-UtLiLd,S taarrgtsB-c>assetn,d bnuTfhfr,e aadrsgBsc-a>srte,c v&bduifrfe,c t -| > ^o ut, d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r202e:c53t:- >note: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo wn, a202r | g s - > s e n d bRuufnfW,o rakrEglse-m>ernetcnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) .run( w202e | ) ; | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppn:W8o:r1k:E lnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herem ent <8F | nI,M PTL,_ CROeLdLO_pF,U NACl(gAol,l RPerdoutcoe>,( )C.OrLuLnN(EwTe_)D;I R E| C ^T , SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp,: 7S:u1m:P onote: sin instantiation of member function 'RunWork, 2, 2>::run' requested heret Div, i7n | tI6M4P_Lt_)C O L| L^_ FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:(391A:l95l:R enote: dexpanded from macro 'IMPL_COLL_FUNC'u ce, C391O | L L NREuTn_WDoIrRkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391N:C95C:L _note: Aexpanded from macro 'IMPL_COLL_FUNC'L GO_##al g391o | , NRCuCnLW_oPrRkOf(u)n.cr,u nt(y&pnec,c lFSuhnmce#m#.dweovrrke)d;o p\< t y| p ^e >, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:A: 562Lnote: G:field 'nthreads' will be initialized after field 'tidInBlock'O15 _:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]562l | g o , NtCi Cd562L( | _t Pi Rd O) T,tO i_nd#t(#htprireodat)d,os> (n(nt)th.hrrrueenaa(dd&ssn()cn,tc hltrSiehdamIdensmB).l,wo octrkik(d)tI;hn rB\el ao dc| Ik ^d( xt.hxr)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he,:a 562dg:Ir15do:xu .pnote: x(field 'nthreads' will be initialized after field 'tidInBlock')g ,r ogurp o)562u, | p ( g| r ^~~~~~~~~~~~~~~~~ o tuip/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd):(,562t :i 60d:| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~,note: field 'group' will be initialized after field 'stepSize' n | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h r e562a | d s 563( | n tt hi rd e(sattdiesdp))S,,i ztneit(dhnIrcnecBalldSoshc(mkne(tmth.hrcreoeamadmds.I)db,xu f.tfxiS)d,iI zngeBrslo[ouNcpCk(C(gLtr_hoPruRepOa)Td,OI _d Sx| I. ^~~~~~~~~~~~~~~~~Mx P)L,E /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]g:/r562No:Cu60Cp:L( _gnote: Srfield 'group' will be initialized after field 'stepSize'To EuPpS)/, s 562i | z| e ^~~~~~~~~~~ o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r677e:a11d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidIn B677l | o c k ( t h r e a d Ipdrxi.mxs)(,t igdr-otuipd(SgtraorutpB)c,a s t| , ^~~~~~~~~~~ nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.halgo,: 562N:C15C:L _warning: Pinitializer order does not match the declaration order [-Wreorder-ctor]R OTO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhkr)e;a d\s ) ,| ^t idInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nthre a563d | s ) , tsitdeIpnSBilzoec(kn(ctchlrSehamdeImd.xc.oxm)m,. bgurfofuSpi(zgerso[uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~O TO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E60]:/ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i687d:I11n:B lnote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(thr e687a | d I d x . x ) , g rporuipm(sg(rtoiudp-)t,i d S| t ^~~~~~~~~~~a rtBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threabdIdxu.fxf),, g| r ^o up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, nth:r562e:a15d:s (warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~n Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL:_562AL:G15O:_ #warning: #initializer order does not match the declaration order [-Wreorder-ctor]a lgo, NCCL_PROTO_ #562# | p r o t ot>i(d)(.triudn)(,& nnctchlrSehamdesm(.nwtohrrke)a;d s\) , | t ^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nthre a563d | s ) , tsitdeIpnSBilzoec(kn(ctchlrSehamdeImd.xc.oxm)m,. bgurfofuSpi(zgerso[uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~O TO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:] /note: Nfield 'group' will be initialized after field 'stepSize'C CL_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l655o:c11k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadId x655. | x ) , g r o u p ( gprroiumps)(,t i d| - ^~~~~~~~~~~t idStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (t)i.drIunnB(lwoec)k;( t h| r ^e adIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppx:)7,: 1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep (gro u7p | )I,M P L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C O L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ FUNC(A l563l | R e d u cset,e pCSOiLzLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,15 : | warning: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563562 | | sttiedp(Stiizde)(,n cnctlhSrhemaedms.(cnotmhmr.ebaudfsf)S,i zteisd[INnCBClLo_cPkR(OtThOr_eSaIdMIPdLxE.]x/)N,C CgLr_oSuTpE(PgSr/osuipz)e,o f (| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ) | { tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 563| | group(group stepSize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:c666c:l9S:h mnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem .comm. b666u | f f S i z e s [ NpCrCiLm_sP(RtOiTdO,_ SnITMhPrLeEa]d/sNGCaCtLh_eSrT,E PdSi/rseiczte-o>fu(pT,) )N U{L L ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a r g| s group(group- >sendbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:f626,: 9a:r gnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >recv b626u | f f , | ^ prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:-53t:i dnote: Sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret artS c202a | t t e r , n T hRruenaWdosrSkcEaltetmeern,t uApl,g oa,r gPsr-o>tsoe>n(d)b.urfufn,( waer)g;s - >| r ^e cvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp,: 8 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :8202 | :I53M:P Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC OLL_F U202N | C ( A l l R e d uRcuen,W oCrOkLELlNeEmTe_nDtIi(n)t.6r4u_nt()w e )| ;^ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppexpanded from macro 'IMPL_COLL_FUNC': 9:1: note: 391in instantiation of member function 'RunWork, 2, 2>::run' requested here | Ru n9W | oIrMkP, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ llReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562w:o15r:k )warning: ;initializer order does not match the declaration order [-Wreorder-ctor] \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group (563g | r o u p )s,t e p| S ^~~~~~~~~~~~~~~~~i ze(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p666):,9 : | note: ^~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:r562,: 15a:r gwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]- >sendbuf f562, | a r g st-i>dr(etcivdb)u,f fn,t h r| e ^a ds(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53):, note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei dInB l202o | c k ( t h r e a dRIudnxW.oxr)k,E lgermoeunpt(s(t)e.prSuinz(ew(en)c;c l S| h ^m em.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppm:m8.:b1u:f fnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested herei zes[ N8C | CILM_PPLR_OCTOOL_LS_IFMUPNLCE(]A/lNlCRCeLd_uScTeE,P SC/OsLiLzNeEoTf_(DTI)R)E C{T , | S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(groupE , SumPos/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:D641i:v11,: inote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret 64_t) 641| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95p:r inote: mexpanded from macro 'IMPL_COLL_FUNC's (tid-t i391d | S t aRrutnRWeodrukc#d#odwenv,r e&ddoipr>o,u tN,C CaLr_gAsL-G>Os_e#n#dablugfof,, NaCrCgLs_-P>RrOeTcOv_b#u#fpfr,o t o| > ^( ).run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c202c:l53S:h mnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .wor k202) | ; \ | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hW:o562r:k15E:l enote: mfield 'nthreads' will be initialized after field 'tidInBlock'e ntd(s)(.nrtuhnr(ewaed)s;) , | t ^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppo:c9k:(1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea dId x9. | xI)M,P Lg_rCoOuLpL(_gFrUoNuCp()A,l l R| e ^~~~~~~~~~~~~~~~~d uce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562C:O60L:L Nnote: Efield 'group' will be initialized after field 'stepSize'T _DIRE C562T | , S I MtPiLdE(,t iSdu)m,P onstthDrieva,d su(inntth6r4e_atd)s ) ,| ^t idI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B391l:o95c:k (note: texpanded from macro 'IMPL_COLL_FUNC'h readI d391x | . x )R,u ngWroorukp<(ngcrcoluFpu)n,c # #| f ^~~~~~~~~~~u nc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp(:g9r:o1u:p )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 9 | | I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PL_C O563L | L _ F U NsCt(eAplSliRzeed(unccec,l SChOmLeLmN.EcTo_mDmI.RbEuCfTf,S iSzIeMsP[LNEC,C LS_uPmRPOoTsOt_DSiIvM,P LuEi]n/tN6C4C_Lt_)S T E| P^S /siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:o391f:(95T:) )note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx908. 43 warnings generated when compiling for gfx940. 43 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 43 warnings generated when compiling for gfx90a. 43 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for host. 43 warnings generated when compiling for gfx906. 43 warnings generated when compiling for gfx900. 43 warnings generated when compiling for gfx803. 43 warnings generated when compiling for gfx1102. 43 warnings generated when compiling for gfx1100. 43 warnings generated when compiling for gfx1101. 43 warnings generated when compiling for gfx1030. 43 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireIn file included from W/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cppo:r1d: PIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:S10l: iIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.he:*169w: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hr:p271 :+19 :2 *warning: wunused variable 'ptr' [-Wunused-variable]i d; | ^ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | o>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] i d562, | g r o utpiNdt(htrieda)d,s ,n t&hrreecavd,s (&nstehnrde,a dasr)g,s -t>isdeInndBbluofcfk,( tahrrgesa-d>Irdexc.vxb)u,f fg,r o u| p ^( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 202 : 53 : snote: tin instantiation of member function 'RunWorkElement, 3, 2>::run' requested heree pSize(n c202c | l S h m e m . c oRmumn.WbourfkfESliezmeesn[tNP(S)/.sriuzne(owfe()T;) ) | { ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp :4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916: 74: | Inote: Min instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereP L_COL L916_ | F U N C ( A lplrRiemdsu(cger,o uCpOTLiLdN,E Tg_rCoHuApINNt,h rSeIaMdPsL,E ,& rSeucmvP,o s&tsDeinvd,, ianrtg8s_-t>)s e n| d^b uff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:r enote: cexpanded from macro 'IMPL_COLL_FUNC'v buff, | 391 ^ | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r202k:<53n:c cnote: lin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereF unc# #202f | u n c , t y p eR,u nFWuonrck#E#ldeemvernetdR,e dNOCpC,L _AAlLgGoO,_ #P#raoltgoo>,( )N.CrCuLn_(PwReO)T;O _ #| # ^p roto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp(:)9.:r1u:n (note: &in instantiation of member function 'RunWork, 3, 2>::run' requested heren cclS h9m | eImM.PwLo_rCkO)L;L _\F U N| C ^( AllReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :C15O:L Lnote: Nfield 'nthreads' will be initialized after field 'tidInBlock'E T_CHAI N562, | S I M PtLiEd,( tSiudm)P,o snttDhirve,a dusi(nntt6h4r_eta)d s )| ,^ tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l391o:c95k:( tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eadId x391. | x ) ,R ugnrWoourpk(d,) ,N CnCtLh_rAeLaGdOs_(#n#tahlrgeoa,d sN)C,C Lt_iPdRIOnTBOl_o#c#kp(rtohtroe>a(d)I.drxu.nx()&,n cgcrloSuhpm(egmr.owuopr)k,) ; | \ ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().In file included from run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:&1n: cIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:S10h: mIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hm:.169w: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hr:k271):;19 :\ warning: unused variable 'ptr' [-Wunused-variable]| ^ 271 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock'u int6 4562_ | t * p ttri d=( triedc)v,P tnrt(h0r)e+aldls1(2n8tOhfrfesaedts;) , | t ^~~i dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hthread:s)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: )field 'group' will be initialized after field 'stepSize', tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I n| B tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ock(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~. buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Bloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f f S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z es[NC C563L | _ P R O TsOt_eSpISMiPzLeE(]n/cNcClCSLh_mSeTmE.PcSo/msmi.zbeuofff(STi)z)e s{[ N C| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ P| R group(groupO TO_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:]641/:N11C:C Lnote: _in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS TEPS/si z641e | o f ( T ) ) { | p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r i m| s group(group( tid-tidStartR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:d641u:c11e:, note: nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT hreads R641e | d u c e , d i r e cptr-i>mdso(wtni,d -&tdiidrSetcatr-t>Roeudtu,c ea,r gnsT-h>rseeanddsbRuefdfu,c ea,r gdsi-r>ercetc-v>bduofwfn,, &| d ^i rect->ou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:,202 :a53r:g snote: -in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here> sen d202b | u f f , a r g sR-u>nrWeocrvkbEulfefm,e n t| < ^F n, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d202O:p53,: Anote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg o, P r202o | t o > ( ) . r u nR(uwneW)o;r k E| l ^e ment, 2, 2>::run' requested herep , Al g4o | ,I MPPrLo_tCoO>L(L)_.FrUuNnC((wAel)l;R e d| u ^c e, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppL:N5E:T1_:D Inote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested hereE CT, S5I | MIPMLPEL,_ CPOrLoLd_,F UiNnCt(8A_ltl)R e d| u^c e, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L391L:N95E:T _note: Dexpanded from macro 'IMPL_COLL_FUNC'I RECT, S391I | M P LREu,n WPorrokd<,n cucilnFtu8n_ct#)# f u| n^c , ty/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:e391,: 95F:u nnote: cexpanded from macro 'IMPL_COLL_FUNC'# #devre d391o | p < tRyupneW>o,r kNp(<)t.yrpuen>(,& nNcCcClLS_hAmLeGmO._w#o#rakl)g;o ,\ N C| C ^L _PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:p15r:o tnote: ofield 'nthreads' will be initialized after field 'tidInBlock'> ().run (562& | n c c l Sthimde(mt.iwdo)r,k )n;t h\r e a| d ^s (nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBl o562c | k ( t h rteiadd(Itdixd.)x,) ,nts(nthreahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk),( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), | 563 ^~~~~~~~~~~~~~~~~ | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S60h:m enote: mfield 'group' will be initialized after field 'stepSize'. comm.buff S562i | z e s [ NtCiCdL(_tPiRdO)T,O _nStIhMrPeLaEd]s/(NnCtChLr_eSaTdEsP)S,/ stiizdeIonfB(lTo)c)k ({t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d I| d group(groupx .x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o677u:p11):, note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60562: | note: field 'group' will be initialized after field 'stepSize' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | ^~~~~~~~~~~563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h391::56295::15 :note: expanded from macro 'IMPL_COLL_FUNC'warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | 562R | u n W o rtkie,a dNICdC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hxL._xA)L:,G 562O:g_15r#:o# uawarning: plinitializer order does not match the declaration order [-Wreorder-ctor](g gor,o uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ O 562T | O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) # # pt ri563do | t( ot >i (d ))s.,tr eunpntS(ih&zrneeca(dcnslc(ScnhltmShehrmme.eawdmos.r)ck,o) m;tm i.\db Iu nf| Bf ^lS oiczk/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he(:st562[h:Nr15Ce:Ca Ldnote: _Ifield 'nthreads' will be initialized after field 'tidInBlock'Pd RxO.T xO562)_ | ,S Ig Mr Po LutEpi](d/g(NrtCoiCudLp)_),S, T nE tP| hS ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r/ es ai| dz tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)se (onft(hTr )e563)a | d{ s ) ,| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tt ei pd| SI group(groupinz Bel(onc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hck:c(677ltS:hh11mr:ee manote: .dcin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereIo mdmx.. bx677u)f | f, S i g zr eo us p[ (N gC rC oLpu_rpiP)mR,sO (T tO| i ^~~~~~~~~~~~~~~~~_d S-It/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hMi:Pd562LS:Et]60a/:rN tCnote: BCfield 'group' will be initialized after field 'stepSize'cL a_sSt T,562E | Pn ST /h sr ietziaeddo(sftB(iTcd)a))s, t{ , n | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& hd ri| er group(groupae dcst(-n>tohurte,a dd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hsi:)r,641e: c11tt:i-d >Inote: dnin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereoB lwonc,k ( a641tr | hgread I d x . x ) , g rporuipm(sg(rtoup)i,d - t| i ^~~~~~~~~~~d StartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]t idInBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e s [| N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_P R563O | T O _ S IsMtPeLpES]i/zNeC(CnLc_cSlTSEhPmSe/ms.iczoemomf.(bTu)f)f S{i z e| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~[ N C| C group(groupL _PROTO_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:]687/:N11C:C Lnote: _in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS TEPS/ s687i | z e o f ( T ) ) { p r| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m s (| t group(groupi d-tidStartBc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:s666t:,9 :n Tnote: hin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadsB c666a | s t , & d i r epcrti-m>so(utti,d ,n unlTlhprtera,d saGragtsh-e>rs,e nddibruefcft,- >aurpg,s -N>UrLeLc,v baurfgfs,- > s| e ^n dbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.htid:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~_ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement (warning: )initializer order does not match the declaration order [-Wreorder-ctor]. run(we); 562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:i6d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read s6( | nItMhPrLe_aCdOsL)L,_ FtUiNdCI(nABllloRcekd(utcher,e aCdOILdLxN.ExT)_,D IgRrEoCuTp,( gSrIoMuPpL)E,, P| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o d ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i nt32_ t563) | | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem. c391o | m m .RbuunfWfoSrikz), {N C C| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ A L| G group(groupO _##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C687C:L11_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_##pr o687t | o > ( ) . r u n ( & npcrcilmSsh(mteimd.-wtoirdkS)t;a r\t B c| a ^s t, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s Bnote: cfield 'nthreads' will be initialized after field 'tidInBlock'a st, &562d | i r e c tt-i>do(utti,d )n,u lnltphtrre,a dasr(gnst-h>rseeanddsb)u,f ft,i daIrngBsl-o>crke(ctvhbruefafd,I d x| . ^x ), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : note: Rfield 'group' will be initialized after field 'stepSize'u nWork E562l | e m e n tt)(,) .triudnI(nwBel)o;c k (| t ^h readIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppx:)5,: 1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep (grou p5) | ,I M P| L ^~~~~~~~~~~_ COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), | group( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(group p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.halgo, N:C562C:L15_:P Rwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]T O_##proto>().run (562& | n c c l Sthimde(mt.iwdo)r,k nthr)e;a d\s ( n| t ^h reads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bnote: lfield 'nthreads' will be initialized after field 'tidInBlock'o ck(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ), ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R60O:T Onote: _field 'group' will be initialized after field 'stepSize'S IMPLE ]562/ | N C C L _tSiTdE(PtSi/ds)i,z enotfh(rTe)a)d s{( n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups ), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a626d:I9d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , group (626g | r o u p ) , | p ^~~~~~~~~~~r ims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T )562) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(group id), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s641(:n11t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds), t641i | d I n B l o c k ( t hprreiamdsI(dtxi.dx-)t,i dgSrtoaurpt(Rgerdouucpe),, n T| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s Reduc e563, | d i r esctte-p>Sdiozwen(,n c&cdliSrhemcetm-.>cooumtm,. baurfgfsS-i>zseesn[dNbCuCfLf_,P RaOrTgOs_-S>IrMePcLvEb]u/fNfC,C L _| S ^T EPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:f202(:T53):) note: {in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202| | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:u677n:W11o:r knote: Ein instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ement <677F | n , T , R e d O pp,r iAmlsg(ot,i dP-rtoitdoS>t(a)r.trBucna(swte,) ;n T h| r ^e adsBcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:,6 :&1d:i rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herec t->o u6t | ,I MdPiLr_eCcOtL-L>_dFoUwNnC,( AalrlgRse-d>usceen,d bCuOfLfL,N EaTr_gDsI-R>ErCeTc,v bSuIfMfP,L E ,| ^P rod, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t2023:253_:t )note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork E391l | e m eRnutn (F)u.nrcu#n#(dweev)r;e d o| p ^< type>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp,: 5N:C1C:L _note: Ain instantiation of member function 'RunWork, 2, 2>::run' requested hereL GO_ #5# | aIlMgPoL,_ CNOCLCLL__FPURNOCT(OA_l#l#Rperdoutcoe>,( )C.OrLuLnN(E&Tn_cDcIlRSEhCmTe,m .SwIoMrPkL)E;, \P r o| d ^, uint8_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:)562 : 15| :^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~~~~~~~_ ##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:,562 :N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'P ROTO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhkr)e;a d\s ) ,| ^t idIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthrea 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | S i z e ( n c c l S hpmreimm.sc(otmimd.-btuifdfSStiazretsR[eNdCuCcLe_,P RnOTThOr_eSaIdMsPRLeEd]u/cNeC,C Ld_iSrTeEcPtS-/>sdiozweno,f (&Td)i)r e{c t -| > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u t| , group(group args->sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :a655r:g11s:- >note: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree cvbuff ,655 | | ^ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:i202m:s53(:t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here- tidSt a202r | t R e d u c e , RnuTnhWroerakdEslReemdeuncte<,F nn,u lTl,p tRre,d O&pd,i rAelcgto-,> oPurto,t oa>r(g)s.-r>usne(nwdeb)u;f f ,| ^a rgs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp-:>5r:e1c:v bnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heref f, | 5 ^ | IMPL_COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:F202U:N53C:( Anote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel Reduc e202, | C O L L N E T _RDuInRWEoCrTk,E lSeImMePnLtE<,F nP,r oTd,, RueidnOtp8,_ tA)l g o| ,^ Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:>391(:)95.:r unote: nexpanded from macro 'IMPL_COLL_FUNC'( we); | 391 ^ | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppr:k8<:n1c:c lnote: Fin instantiation of member function 'RunWork, 2, 2>::run' requested hereu nc## f8u | nIcM,P Lt_yCpOeL,L _FFuUnNcC#(#AdlelvRreedduocpe<,t yCpOeL>L,N ENTC_CDLI_RAELCGTO,_ #SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreadCs(nthCrLe_aPdRsO)T,O _tSiIdMIPnLBEl]o/cNkC(CtLh_rSeTaEdPISd/xs.ixz)e,o fg(rTo)u)p ({g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ) ,| group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655 :56211 | : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid(tid )655, | n t h r e a d s ( nptrhirmesa(dtsi)d,- ttiiddSItnaBrltoRcekd(utcher,e andTIhdrxe.axd)s,R egdruocuep,( gnruolulpp)t,r , | & ^~~~~~~~~~~d irect->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hod, u:i562n:t153:2 _warning: tinitializer order does not match the declaration order [-Wreorder-ctor]) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :56295 | : note: expanded from macro 'IMPL_COLL_FUNC' tid(t i391d | ) , RnutnhWroerakdp,( gNrCoCuLp_)A,L G O| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# # a| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g o, N C563C | L _ P R OsTtOe_p#S#ipzreo(tnoc>c(l)S.hrmuenm(.&cnocmcml.SbhumfefmS.iwzoersk[)N;C C\L _ P| R ^O TO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:]15/:N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _STEPS /562s | i z e o ft(iTd)()t i{d ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677t:i11d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(t h677r | e a d I d x . x ) , pgrriomusp((tgirdo-utpi)d,S t a| r ^~~~~~~~~~~~~~~~~t Bcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:,562 :n60T:h rnote: efield 'group' will be initialized after field 'stepSize'a dsBc a562s | t , & dtiirde(ctti-d>)o,u tn,t hdrieraedcst(-n>tdhorwena,d sa)r,g st-i>dsIennBdlboucfkf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hrun(&n:c562c:l15S:h mwarning: einitializer order does not match the declaration order [-Wreorder-ctor] m.work); \ | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~p Si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562(:n60c:c lnote: Sfield 'group' will be initialized after field 'stepSize'h mem. c562o | m m . b utfifdS(itzieds)[,N CnCtLh_rPeRaOdTsO(_nStIhMrPeLaEd]s/)N,C CtLi_dSITnEBPlSo/cski(ztehorfe(aTd)I)d x{. x )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g r| o group(groupu p(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Awarning: Linitializer order does not match the declaration order [-Wreorder-ctor]G O_##algo ,562 | N C C L _tPiRdO(TtOi_d#)#,p rnotthor>e(a)d.sr(unnt(h&rnecacdlsS)h,m etmi.dwIonrBkl)o;c k\( t h| r ^e adId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nt h563r | e a d s (snttehprSeiazdes()n,c ctliSdhImneBml.occokm(mt.hbruefafdSIidzxe.sx[)N,C CgLr_oPuRpO(TgOr_oSuIpM)P,L E ]| / ^~~~~~~~~~~~~~~~~N CC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T60E:P Snote: /field 'group' will be initialized after field 'stepSize's izeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s626(:n9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds), t i626d | I n B l o c k ( tphrriemasd(Itdixd.-xt)i,d SgtraorutpS(cgartotuepr),, n T| h ^~~~~~~~~~~r eadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :p15r:i mwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]( tid-tidS 562 | t a rttiBdc(atsitd,) ,n TnhtrheraedasdBsc(anstth,r e&addisr)e, ctti-d>IonuBtl,o cnku(ltlhprtera,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c v b| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f f, | ^563 | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:p202S:i53z:e (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec clSh m202e | m . c o m m . b uRfufnSWiozreksE[lNeCmCeLn_tPz(e)o.fr(uTn)()w e{) ; | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h1::687 :note: 11in instantiation of member function 'RunWork, 2, 2>::run' requested here: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 6 | IM P687L | _ C O L L _ F U N C (pArlilmRse(dtuicde-,t iCdOSLtLaNrEtTB_cDaIsRtE,C Tn,T hSrIeMaPdLsEB,c aPsrto,d ,& diinrte3c2t_-t>)o u t| ,^ nullpt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:,391 :a95r:g snote: -expanded from macro 'IMPL_COLL_FUNC'> sendbu f391f | , aRrugnsW-o>rrke, 2, 2>::run' requested herec ##de v202r | e d o p < t y p eR>u,n WNoCrCkLE_lAeLmGeOn_t#<#Fanl,g oT,, NRCeCdLO_pP,R OATlOg_o#,# pPrroottoo>>(())..rruunn((&wnec)c;l S h| m ^e m.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppk:)6;: 1\: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | :I562M:P15L:_ Cnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'L L_FUN C562( | A l l R etdiudc(et,i dC)O,L LnNtEhTr_eDaIdRsE(CnTt,h rSeIaMdPsL)E,, tPirdoIdn,B lionctk3(2t_htr)e a d| I^d x.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391g:r95o:u pnote: (expanded from macro 'IMPL_COLL_FUNC'g roup), 391 | | ^~~~~~~~~~~~~~~~~ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hW:o562r:k60<:n cnote: cfield 'group' will be initialized after field 'stepSize'l Func# #562f | u n c , ttiydp(et,i dF)u,n cn#t#hdreevardesd(onpt),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~n (&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tRunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hproto>:(562):.15r:u nwarning: (initializer order does not match the declaration order [-Wreorder-ctor]& ncclShmem.work )562; | \ | t ^i d(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd s(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~~~~~~~S hmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:c562o:m60m:. bnote: ufield 'group' will be initialized after field 'stepSize'f fSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u687p:)11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_S562I:M15P:L Ewarning: ]initializer order does not match the declaration order [-Wreorder-ctor]/ NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 : awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->sendb u562f | f , a rtgisd-(>triedc)v,b unftfh,r e a| d ^s (nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l202o:c53k:( tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eadIdx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e n t| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)F n, T, 563R | e d O p ,s tAelpgSoi,z eP(rnoctcol>S(h)m.ermu.nc(owmem).;b u f| f ^S izes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppC:C10L:_1P:R Onote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested hereO _SIM P10L | EI]M/PNLC_CCLO_LSLT_EFPUSN/Cs(iAzleloRfe(dTu)c)e ,{ C O| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L N E| T group(group_ DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :S666I:M9P:L Enote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Prod, 666h | a l f ) | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:(391t:i95d:, note: nexpanded from macro 'IMPL_COLL_FUNC'T hreads G391a | t h eRru,n WdoirrkeluFpu,n cN#U#LfLu,n ca,r gtsy-p>es,e nFdubnucf#f#,d eavrrgesd-o>prf,f ,N C C| L ^_ ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#202a:l53g:o ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN CCL _202P | R O T O _ # # p rRoutnoW>o(r)k.Erluenm(e&nntc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r15u:n (note: wfield 'nthreads' will be initialized after field 'tidInBlock'e ); | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :t8i:d1(:t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here) , nt h8r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( AtlildRIendBulcoec,k (CtOhLrLeNaEdTI_dDxI.RxE)C,T ,g rSoIuMpP(LgEr,o uPpr)o,d , | i ^~~~~~~~~~~~~~~~~n t6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h4:_562t:)60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid )391, | n tRhurneWaodrsk(g,r oNuCpC)L,_ A L| G ^~~~~~~~~~~O _##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.houp(gr:o562u:p15):, warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e ) ;R u n| W ^o rkElement, 2, 2>::run' requested herel go, P r10o | tIoM>P(L)_.CrOuLnL(_wFeU)N;C ( A| l ^l Reduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppE:C8T:,1 :S Inote: Min instantiation of member function 'RunWork, 2, 2>::run' requested hereP LE, Pr o8d | ,I MhPaLl_fC)O L L| _^F UNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:R391e:d95u:c enote: ,expanded from macro 'IMPL_COLL_FUNC' COLLNET _391D | I R ERCuTn,W oSrIkM, 391N | C C LR_uAnLWGoOr_k#<#naclcgloF,u nNcC#C#Lf_uPnRcO,T Ot_y#p#ep,r oFtuon>c(#)#.dreuvnr(e&dnocpcm,. wNoCrCkL)_;A L\G O _| # ^# algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Onote: Tfield 'nthreads' will be initialized after field 'tidInBlock'O _##pr o562t | o > ( ) .triudn((t&indc)c,l Snhtmherme.awdosr(kn)t;h r\e a d| s ^) , tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a dIdx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~h re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:I dnote: xfield 'group' will be initialized after field 'stepSize'. x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]h readIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threa d563s | ) , t isdtIenpBSliozcek((ntchcrleSahdmIedmx..cx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho):562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h NCCL_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>().run (562& | n c c l Sthimde(mt.iwdo)r,k )n;t h\r e a| d ^s (nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, note: tfield 'nthreads' will be initialized after field 'tidInBlock'i dInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~~~~~~~f fSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562s:[60N:C Cnote: Lfield 'group' will be initialized after field 'stepSize'_ PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:.626x:)9,: gnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up(gr o626u | p ) , | ^~~~~~~~~~~ prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hargs->:r562e:c15v:b uwarning: finitializer order does not match the declaration order [-Wreorder-ctor]f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id(t i202d | ) , n t h r e aRdusn(WnotrhkrEelaedmse)n,t r(o)u.pr(ugnr(owuep));, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :9:1: note: 563in instantiation of member function 'RunWork, 2, 2>::run' requested here | 9s | tIeMpPSLi_zCeO(LnLc_cFlUSNhCm(eAml.lcRoemdmu.cbeu,f fCSOiLzLeNsE[TN_CDCILR_EPCRTO,T OS_ISMIPMLPEL,E ]P/roNdC,C Lu_iSnTtE6P4S_/ts)i z e| o^f (T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 391{: 95 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: expanded from macro 'IMPL_COLL_FUNC' | group(group 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #func, t687y | p e , F u n c # # dpervirmesd(otpiS,t aNrCtCBLc_aAsLtG,O _n#T#harlegaod,s BNcCaCsLt_,P R&OdTiOr_e#c#tp-r>ootuot>,( )n.urlulnp(t&rn,c calrSghsm-e>ms.ewnodrbku)f;f ,\ a r| g ^s ->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fnote: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:(53t:i dnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nth r202e | a d s ( n t h r eRaudnsW)o,r ktEildeImneBnltou(p)).,r u n| ( ^~~~~~~~~~~~~~~~~w e)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp562: | 7 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid) ,7 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Pgrroodu,p )u,i n t| 3 ^~~~~~~~~~~2 _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclbShmem.cuofmmf.SbiuzfefsS[iNzCeCsL[_NPCRCOLT_OP_RSOITMOP_LSEI]M/PNLCEC]L/_NSCTCELP_SS/TsEiPzSe/osfi(zTe)o)f ({T ) )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~{ | | group(group ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here666 :9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here687 | 666 | p r i m s ( tpirdi-mtsi(dtSitda,r tnBTcharseta,d snGTahtrheeard,s Bdciarsetc,t -&>duipr,e cNtU-L>Lo,u ta,r gnsu-l>lspetnrd,b uafrfg,s -a>rsgesn-d>bruefcfv,b uafrfg,s - >| r ^e cvbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: note: 202in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | 202 | R u n W o r kREulneWmoernktE (P)r.ortuon>((w)e.)r;u n (| w ^e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::110:: 1note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | 10I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,P rPordo,d ,u ihnatl6f4)_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 391note: :expanded from macro 'IMPL_COLL_FUNC' 95: note: expanded from macro 'IMPL_COLL_FUNC'391 | Run W391o | r k o,p A,L GNOC_C#L#_aAlLgGoO,_ #N#CaClLg_oP,R ONTCOC_L#_#PpRrOoTtOo_>#(#)p.rroutno(>&(n)c.crluSnh(m&enmc.cwloSrhkm)e;m .\w o r| k ^) ; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :562 | note: field 'nthreads' will be initialized after field 'tidInBlock' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 60 :| ^~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 60 : note: field 'group' will be initialized after field 'stepSize't id(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~p (group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:1562:: 15note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here warning: initializer order does not match the declaration order [-Wreorder-ctor] 8 | IMPL_ C562O | L L _ F UtNiCd((AtlildR)e,d uncteh,r eCaOdLsL(NnEtTh_rDeIaRdEsC)T,, tSiIdMIPnLBEl,o c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPkr(o:tdh,562r :ei15an:d tIwarning: 6dinitializer order does not match the declaration order [-Wreorder-ctor]4 x_.tx)) , | g^r oup (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562g: | r391 o: u95p:) , note: t expanded from macro 'IMPL_COLL_FUNC' | i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t391| i | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) d )R, u563 n | Wn ot rh kr T.,Ox _)NSCI,CMLP _LgAErL]Go/OuN_pC#(C#gLar_loSguTopE,)P ,SN /C sC| iL ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~z_ eP oR| fO tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)(T TO)_)# #{ p563 r o | | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o > (| ) group(group . rsutne(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp&:Sn641ic:zc11el:(S nhnote: cmin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herece lmS.h wm641oe | rm k. )c ; o \m m . | b ^u fpfr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hSi:im562zs:(15t:i dnote: -field 'nthreads' will be initialized after field 'tidInBlock't iedsS [t562aN | rC t R Ce Ldtu_icdPe(R,tO TinOdT_)hS,rI eMnaPtdhsLrREee]add/usNc(CenC,tL h_driSerTaedEcstP)-,S> /dtsoiiwdnIz,ne Bo&ldofic(rkTe()ct)th -r{>e oa ud| tI ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~,d x a.| rxg group(group)s ,- >gsreonudpb(ugfrfo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu :pa641)r:,g11 s :-| > ^~~~~~~~~~~~~~~~~note: r in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h c:v562b:u60f:f ,note: 641 field 'group' will be initialized after field 'stepSize' | | ^ 562 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 : t pinote: rdin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei( mtsi (d202t) | i, d -n t th ir de a SdRsut(annrWttohrrRkeeEadldesum)ce,en ,tt o(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rect->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h),: 562 :| 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: warning: | initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562 | s tteipdS(itzied()n,c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhread:s562(:n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~t idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hproto>:(562):.15r:u nwarning: (initializer order does not match the declaration order [-Wreorder-ctor]& ncclShmem.work )562; | \ | t ^i d(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~~~~~~~S hm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c60o:m mnote: .field 'group' will be initialized after field 'stepSize'b uffS i562z | e s [ N CtCiLd_(PtRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p655):,11 : | note: ^~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ( T202) | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn WorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereA lgo, P r641o | t o > ( ) . r u n ( wper)i;m s (| t ^i d-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppa:r8t:R1e:d unote: cin instantiation of member function 'RunWork, 2, 2>::run' requested heree , nT h8r | eIaMdPsLR_eCdOuLcLe_,F UdNiCr(eAcltl-R>eddouwcne,, &CdOiLrLeNcEtT-_>DoIuRtE,C Ta,r gSsI-M>PsLeEn,d bPurfofd,, airngts6-4>_rte)c v b| u^f f, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :39153 | : note: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu nWork <202n | c c l F u n c # #RfuunnWco,r ktEylpeem,e nFtuo,, NPCrCoLt_oA>L(G)O._r#u#na(lwgeo),; N C| C ^L _PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp#:#9p:r1o:t onote: >in instantiation of member function 'RunWork, 2, 2>::run' requested here( ).ru n9( | &InMcPcLl_SChOmLeLm_.FwUoNrCk()A;l l\R e d| u ^c e, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562N:E15T:_ Dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'R ECT, S562I | M P L E ,t iPdr(otdi,d )u,i nntt6h4r_eta)d s (| n^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391):,95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'I nBlock (391t | h r eRaudnIWdoxr.kx<)n,c cglrFouunpc(#g#rfouunpc),, t y| p ^~~~~~~~~~~~~~~~~e , F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562c:#60#:d enote: vfield 'group' will be initialized after field 'stepSize'r edop <562t | y p e > ,t iNdC(CtLi_dA)L,G On_t#h#raelagdos,( nNtChCrLe_aPdRsO)T,O _t#i#dpIrnoBtloo>c(k)(.trhurne(a&dnIcdcxl.Sxh)m,e mg.rwoourpk()g;r o\u p )| , ^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h stepS:i562z:e15(:n cwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,626 : 9| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 626 | 563 | s t epprSiimzse((tnicdc-ltSihdmSetma.rctoSmcma.tbtuefrf,S inzTehsr[eNaCdCsLS_cPaRtOtTeOr_,S INMUPLLLE,] /dNiCrCeLc_tS-T>EuPpS,/ sairzgeso-f>(sTe)n)d b{u f f| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->recvbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: 655note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s (RtuindW-otrikdESlteamretnRtep(t)r.,r u&nd(iwree)c;t - >| o ^u t, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpps:-10>:s1e:n dnote: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ff, 10a | rIgMsP-L>_rCeOcLvLb_uFfUfN,C ( A| llRe ^d uce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:N202E:T53_:D Inote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE CT, S202I | M P L E , P r oRdu,n WhoarlkfE)l e m| e^n tr(k)<.nrcucnl(Fwuen)c;# # f| u ^n c, typ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppe:,11 :F1u:n cnote: #in instantiation of member function 'RunWork, 2, 2>::run' requested here# dev r11e | dIoMpPL,_ FNUCNCCL(_AAlLlGROe_d#u#cael,g oC,O LNLCNCELT__PDRIORTEOC_T#,# pSrIoMtPoL>E(,) .Prruond(,& nfclcolaSth)m e m| .^w ork);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :\391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 15 : Rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n Work< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~~~~~~~t o>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n60(:& nnote: cfield 'group' will be initialized after field 'stepSize'c lShme m562. | w o r k )t;i d\( t i| d ^) , nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h read s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~k (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hmem.w:o562r:k15):; warning: \initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), 563| | ^~~~~~~~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S60i:z enote: (field 'group' will be initialized after field 'stepSize'n cclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tis->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562m:.15w:o rwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]) ; \ | ^ 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~( ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: cfield 'group' will be initialized after field 'stepSize'o mm.b u562f | f S i z etsi[dN(CtCiLd_)P,R OnTtOh_rSeIaMdPsL(En]t/hNrCeCaLd_sS)T,E PtSi/dsIinzBeloofc(kT()t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:g687r:o11u:p )note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementi(d)(.triudn)(,w en)t;h r e| a ^d s(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp):,9 :t1i:d Inote: nin instantiation of member function 'RunWork, 2, 2>::run' requested hereB lock(t h9r | eIaMdPILd_xC.OxL)L,_ FgUrNoCu(pA(lglrRoeudpu)c,e , | C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O L L| N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E T_DI R563E | C T , SsItMePpLSEi,z eP(rnocdc,l Suhimnetm6.4c_otm)m . b| u^f fSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h[:N391C:C95L:_ Pnote: Rexpanded from macro 'IMPL_COLL_FUNC'O TO_SIMPL E391] | / N CRCuLn_WSoTrEkP, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herey pe>, N655C | C L _ A L G prOi_m#s#(atligdo-,t iNdCSCtLa_rPtRROeTdOu_c#e#,p rnoTthor>e(a)d.srRuend(u&cnec,c lnSuhlmlepmt.rw,o r&kd)i;r e\c t -| > ^o ut, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:s enote: nfield 'nthreads' will be initialized after field 'tidInBlock'd buff, a562r | g s - > rteicdv(btuifdf),, n| t ^h reads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202h:r53e:a dnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , ti d202I | n B l o c k ( t hRruenaWdoIrdkxE.lxe)m,e ngtrnote: (field 'group' will be initialized after field 'stepSize') .run(w e562) | ; | ^t id(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:)10,: 1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads( n10t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~P rod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | t i563d | ( t i d )s,t enptShirzeea(dnsc(cnltShhrmeeamd.sc)o,m mt.biudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~i zeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :562 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ti d202( | t i d ) , n t hRruenaWdosr(knEtlhermeeandts<)F,n ,t iTd,I nRBeldoOcpk,( tAhlrgeoa,d IPdrxo.txo)>,( )g.rrouunp((wger)o;u p )| , ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :9:1: 563note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here s t9e | pISMiPzLe_(CnOcLcLl_SFhUmNeCm(.AclolmRme.dbuucfef,S iCzOeLsL[NNECTC_LD_IPRREOCTTO,_ SSIIMMPPLLEE],/ NPCrCoLd_,S TuEiPnSt/6s4i_zte)o f (| T^) ) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~391 : 95| : group(group note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 666391: | 9 : Rnote: uin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Work <666n | c c l F u n c # #pfruinmcs,( ttiydp,e ,n TFhurnec##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562s:[15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ PROTO_SIMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:k641(:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered Idx.x )641, | g r o u p ( g r o uppr)i,m s (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d -| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i dS t563a | r t R e dsutceep,S inzTeh(rnecacdlsSRhemdeumc.ec,o mdmi.rbeucftf-S>idzoewsn[,N C&CdLi_rPeRcOtT-O>_oSuItM,P LaEr]g/sN-C>CsLe_nSdTbEuPfSf/,s iazregosf-(>Tr)e)c v{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h202::67753::11 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 202 | 677 | R u n W oprrkiEmlse(mteindt-s(B)c.arsutn,( w&ed)i;r e c| t ^- >out, dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppe:c9t:-1>:d onote: win instantiation of member function 'RunWork, 2, 2>::run' requested heren , ar g9s | -I>MsPeLn_dCbOuLfLf_,F UaNrCg(sA-l>lrReecdvubcuef,f ,C O L| L ^N ET_DIREC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:,202 :S53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE , Pr o202d | , u i n t 6 4 _Rtu)n W o| r^k Eleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391<:F95n:, note: Texpanded from macro 'IMPL_COLL_FUNC', RedOp ,391 | A l gRou,n WPorroktc(c)l.Fruunnc(#w#ef)u;n c ,| ^t ype, F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppu:n10c:#1#:d enote: vin instantiation of member function 'RunWork, 2, 2>::run' requested herer edop <10t | yIpMeP>L,_ CNOCLCLL__FAULNGCO(_A#l#laRlegdou,c eN,C CCLO_LPLRNOETTO__D#I#RpErCoTt,o >S(I)M.PrLuEn,( &Pnrcocdl,S hhmaelmf.)w o r| k^) ; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 391note: | field 'nthreads' will be initialized after field 'tidInBlock' RunWo r562k | < n c c ltFiudn(ct#i#df)u,n cn,t htryepaed,s (Fnutnhcr#e#addesv)r,e dtoipdc,k (NtChCrLe_aAdLIGdOx_.#x#)a,l ggor,o NuCpC(Lg_rPoRuOpT)O,_ # #| p ^~~~~~~~~~~~~~~~~r ot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)60.:r unote: nfield 'group' will be initialized after field 'stepSize'( &ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~I nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 202 | 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:x9.:x1):, note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup( g9r | oIuMpP)L,_ C O| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ F| U tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N C(All R563e | d u c e ,s tCeOpLSLiNzEeT(_nDcIcRlESChTm,e mS.IcMoPmLmE.,b uPfrfoSdi,z eusi[nNtC6C4L__tP)R O T| O^_ SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:]391/:N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'S TEPS/si z391e | o fR(uTn)W)o r{k < n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l F| u group(groupn c##func, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:y677p:e11,: Fnote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren c##dev r677e | d o p < t y p e > , pNrCiCmLs_(AtLiGdO-_t#i#daSltgaor,t BNcCaCsLt_,P RnOTThOr_e#a#dpsrBoctaos>t(,) .&rduinr(e&cntc-c>loSuhtm,e md.iwroerckt)-;> d\o w n| , ^ args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:e562n:d15b:u fnote: ffield 'nthreads' will be initialized after field 'tidInBlock', args -562> | r e c v btuifdf(,t i d| ) ^, nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53(:n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eads )202, | t i d I n B l oRcukn(WtohrrkeEaldeImdexn.tx<)F,n ,g rTo,u pR(egdrOopu,p )A,l g o| , ^~~~~~~~~~~~~~~~~ Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>60(:) .note: rfield 'group' will be initialized after field 'stepSize'u n(we) ;562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:i8d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read s8( | nItMhPrLe_aCdOsL)L,_ FtUiNdCI(nABllloRcekd(utcher,e aCdOILdLxN.ExT)_,D IgRrEoCuTp,( gSrIoMuPpL)E,, P| r ^~~~~~~~~~~o d, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d ( tsitde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBelso[cNkC(CtLh_rPeRaOdTIOd_xS.IxM)P,L Eg]r/oNuCpC(Lg_rSoTuEpP)S,/ s i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e o f| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T )) { | 563 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:c687l:S11h:m enote: min instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. comm. b687u | f f S i z e s [ N C CpLr_iPmRsO(TtOi_dS-ItMiPdLSEt]a/rNtCBCcLa_sStT,E PnST/hsriezaedosfB(cTa)s)t ,{ & d| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e c| t group(group- >out, nul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:p666t:r9,: anote: rin instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s->sen d666b | u f f , a r g sp-r>irmesc(vtbiudf,f ,n T h| r ^e adsGa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202e:r53,: dnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ect -202> | u p , N U L L ,R uanrWgosr-k>EsleenmdebnutferdeOcpv,b uAflfg,o , | P ^r oto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(202w:e53):; note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 9 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR unWo r9k | EIlMePmLe_nCtO_(D)I.RrEuCnT(,w eS)I;M P L| E ^, Prod, ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppn:t106:41_:t )note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | :I391M:P95L:_ Cnote: Oexpanded from macro 'IMPL_COLL_FUNC'L L_FUN C391( | A l lRRuendWuocrek,< nCcOcLlLFNuEnTc_#D#IfRuEnCcT,, tSyIpMeP,L EF,unc##d ePvrroedd,o ph ,| ^N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:L391G:O95_:# #note: aexpanded from macro 'IMPL_COLL_FUNC'l go, NCC L391_ | P R ORTuOn_W#o#rpkrl(F)u.nrcu#n#(f&unnccc,l Sthympeem,. wFournkc)#;# d\e v r| e ^d op15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_AL G562O | _ # # a ltgiod,( tNiCdC)L,_ PnRtOhTrOe_a#d#sp(rnotthor>e(a)d.sr)u,n (t&indcIcnlBSlhomcekm(.twhorreka)d;I d\x . x| ) ^, group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )note: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 60 : tnote: ifield 'group' will be initialized after field 'stepSize'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | ^~~~~~~~~~~~~~~~~562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gro u563p | ) , | s ^~~~~~~~~~~t epSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ecvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorke,a dNsC(CnLt_hArLeGaOd_s#)#,a ltgiod,I nNBClCoLc_kP(RtOhTrOe_a#d#Ipdrxo.txo)>,( )g.rrouunp((&gnrcoculpS)h,m e m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~w o r| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) ; \ | 563 ^ | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S15i:z enote: (field 'nthreads' will be initialized after field 'tidInBlock'n cclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~~~~~~~: 687:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here60 : note: field 'group' will be initialized after field 'stepSize' 687 | 562 | t ipdr(itmisd()t,i dn-tthirdeSatdasr(tnBtcharseta,d sn)T,h rteiaddIsnBBclaosctk,( t&hdrieraedcItd-x>.oxu)t,, gnruolulpp(tgrr,o uapr)g,s - >| s ^~~~~~~~~~~e ndbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]I dx.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rect->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hROTO_#:#562p:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&ncclShm e562m | . w o r kt)i;d (\t i d| ) ^, nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~o mm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:b562u:f60f:S inote: zfield 'group' will be initialized after field 'stepSize'e s[NC C562L | _ P R O TtOi_dS(ItMiPdL)E,] /nNtChCrLe_aSdTsE(PnSt/hsriezaedosf)(,T )t)i d{I n B| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o c k| ( group(groupt hreadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p666(:g9r:o unote: pin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)15.:r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]( &ncclShmem .562w | o r k ) ;t i\d ( t| i ^d ), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: (field 'nthreads' will be initialized after field 'tidInBlock'n thre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a dIdx .563x | ) , g rsotuepp(Sgirzoeu(pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.60b:u fnote: ffield 'group' will be initialized after field 'stepSize'S izes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgrou:p562):,15 : | warning: ^~~~~~~~~~~initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthreads(n t562h | r e a d stid(t)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~x .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , NtCiCdL(_tAiLdG)O,_ #n#tahlrgeoa,d sN(CnCtLh_rPeRaOdTsO)_,# #tpirdoItnoB>l(o)c.kr(utnh(r&enacdcIldSxh.mxe)m,. wgorroku)p;( g\r o u| p ^) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePaLdEI]d/xN.CxC)L,_ SgTrEoPuSp/(sgirzoeuopf)(,T ) )| ^~~~~~~~~~~~~~~~~{ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 60 group(group: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :562 | 687 : 11 : tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid), 687n | t h r e a d s ( n t hprreiamdss()t,i dt-itdiIdnSBtlaorctkB(ctahsrte,a dnITdhxr.exa)d,s Bgcraosutp,( g&rdoiurpe)c,t - >| o ^~~~~~~~~~~u t, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(t:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 563: | 562 : 15 : swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]e pSize(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h563: | 626 : 9 : snote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree pSize (626n | c c l S h m e m .pcroimmms.(btuifdf-StiizdeSst[aNrCtCSLc_aPtRtOeTrO,_ SnITMhPrLeEa]d/sNSCcCaLt_tSeTrE,P SN/UsLiLz,e odfi(rTe)c)t -{> u p| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:,666 :a9r:g snote: -in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> recv b666u | f f , | ^ prims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202,: 53n:T hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree adsG a202t | h e r , d i r eRcutn-W>ourpk,E lNeUmLeLn,t ,s eRneddbOupf,f ,A lagrog,s -P>rroetcov>b(u)f.fr,u n (| w ^e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppnote: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here11 :1: note: 202in instantiation of member function 'RunWork, 2, 2>::run' requested here | 11 | I MRPuLn_WCoOrLkLE_lFeUmNeCn(tAS(I)M.PrLuEn,( wPer)o;d , | f ^l oat) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1 :| ^note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :11391 | :I95M:P Lnote: _expanded from macro 'IMPL_COLL_FUNC'C OLL_FU N391C | ( A lRluRneWdourcke<,n cCcOlLFLuNnEcT#_#DfIuRnEcC,T ,t ySpIeM,P LFEu,n cP#r#odde,v rfeldooapt<)t y p| e^> , NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391A:L95G:O _note: #expanded from macro 'IMPL_COLL_FUNC'# algo, 391N | C C LR_uPnRWOoTrOk_<#n#cpcrloFtuon>c(#)#.fruunnc(,& ntcycpleS,h mFeumn.cw#o#rdke)v;r e\d o p| < ^t ype>, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:A Lnote: Gfield 'nthreads' will be initialized after field 'tidInBlock'O _##al g562o | , N C CtLi_dP(RtOiTdO)_,# #nptrhorteoa>d(s)(.nrtuhnr(e&andcsc)l,S htmiedmI.nwBolrokc)k;( t\h r e| a ^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock'g roup), 562 | | ^~~~~~~~~~~~~~~~~ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), 562| | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S15i:z ewarning: (initializer order does not match the declaration order [-Wreorder-ctor]n cclShmem.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 677 :| 11 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 677 | s t e p S i z e ( npcrcilmSsh(mteimd.-ctoimdmS.tbaurftfBSciazsets,[ NnCTChLr_ePaRdOsTBOc_aSsItM,P L&Ed]i/rNeCcCtL-_>SoTuEtP,S /dsiirzeecotf-(>Td)o)w n{, a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g s -| > group(groups endbuff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:r641e:c11v:b unote: fin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref , | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ims (202t | i d - t i d S t aRrutnRWeodrukcEel,e mneTnhtr dPorwont,o >&(d)i.rreucnt(-w>eo)u;t , | a ^r gs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp-:>11s:e1n:d bnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heref f, a11r | gIsM-P>Lr_eCcOvLbLu_fFfU,N C (| A ^l lReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202C:O53L:L Nnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT _DIRE C202T | , S I M P L E ,R uPnrWoodr,k Eflleomaetn)t < F| n^, T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391e:d95O:p ,note: expanded from macro 'IMPL_COLL_FUNC'A lgo, 391P | r o tRou>n(W)o.rrku, 2, 2>::run' requested here# #dev r11e | dIoMpPL,_ FNUCNCCL(_AAlLlGROe_d#u#cael,g oC,O LNLCNCELT__PDRIORTEOC_T#,# pSrIoMtPoL>E(,) .Prruond(,& nfclcolaSth)m e m| .^w ork)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 391\: 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 15 : Rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n Work <562n | c c l F utnicd#(#tfiudn)c,, nttyhpree,a dFsu(nnct#h#rdeeavdrse)d,o ptl,o cNkC(CtLh_rAeLaGdOI_d#x#.axl)g,o ,g rNoCuCpL(_gPrRoOuTpO)_,# # p| r ^~~~~~~~~~~~~~~~~o to>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r60u:n (note: &field 'group' will be initialized after field 'stepSize'n cclS h562m | e m . w otrikd)(;t i\d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e (nccl S563h | m e m . csotmem.pbSuifzfeS(inzcecsl[SNhCmCeLm_.PcRoOmTmO._bSuIfMfPSLiEz]e/sN[CNCCLC_LS_TPERPOST/Os_iSzIeMoPfL(ET])/)N C{C L | _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S T E| P group(groupS /sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :{626 : 9| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| group(group 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 687 :p11r:i mnote: sin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid-t i687d | S t a r t S c a t t eprr,i mnsT(htrieda-dtsiSdcSattatretrB,c aNsUtL,L ,n Tdhirreeacdts-B>cuaps,t ,a r&gdsi-r>escetn-d>bouuftf,, naurlglsp-t>rr,e cavrbgusf-f>,s e n| d ^b uff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:-53>:r enote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herev buff ,202 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u202n:W53o:r knote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel eme n202t | < F n , T , RReudnOWpo,r kAEllgeom,e nPtr (T),. rRuend(Owpe,) ;A l g| o ^, Proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:)11.:r1u:n (note: win instantiation of member function 'RunWork, 2, 2>::run' requested heree ); | ^11 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppL:_12C:O1L:L _note: Fin instantiation of member function 'RunWork, 2, 2>::run' requested hereU NC( A12l | lIRMePdLu_cCeO,L LC_OFLULNNCE(TA_lDlIRReEdCuTc,e ,S ICMOPLLLEN,E TP_rDoIdR,E CfTl,o aStI)M P L| E^, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391d:o95u:b lnote: eexpanded from macro 'IMPL_COLL_FUNC') | ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r k#,# dNeCvCrLe_dAoLpGa,l gNoC,C LN_CACLLG_OP_R#O#TaOl_g#o#,p rNoCtCoL>_(P)R.OrTuOn_(#&#npcrcoltSoh>m(e)m..rwuonr(k&)n;c c\l S h| m ^e m.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 :\15 : | note: ^field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~p (grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 60 :| ^~~~~~~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: 562note: | field 'group' will be initialized after field 'stepSize' t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC L562_ | A L GO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hreadI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(group), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(tid )563, | n t h rsetaedpsS(inzteh(rnecacdlsS)h,m etmi.dcIonmBml.obcukf(ftShirzeeasd[INdCxC.Lx_)P,R OgTrOo_uSpI(MgPrLoEu]p/)N,C C L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S T E| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S /size o563f | ( T ) ) s{t e p| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i z e| ( group(groupn cclShmem.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.655b:u11f:f Snote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez es[NC C655L | _ P R O T O _ S I M PpLrEi]m/sN(CtCiLd_-StTiEdPSSt/asritzReeodfu(cTe),) n{T h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| R group(groupe duce, nullptr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :&666d:i9r:e cnote: tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >out ,666 | a r g s - > s e npdrbiumfsf(,t iadr,g sn-T>hrreecavdbsuGfaft,h e r| , ^ direct->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202,: 53N:U Lnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, arg s202- | > s e n d b u f fR,u naWrogrsk-E>lreemcevnbtu, 2, 2>::run' requested heret o>() .202r | u n ( w e ) ; R| u ^n WorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppm:e11n:t1<:F nnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here T, R e11d | OIpM,P LA_lCgOoL,L _PFrUoNtCo(>A(l)l.Rreudnu(cwee,) ;C O L| L ^N ET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :S11I:M1P:L Enote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Prod ,11 | fIlMoPaLt_)C O L| L^_ FUNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:l391l:R95e:d unote: cexpanded from macro 'IMPL_COLL_FUNC'e , COLL N391E | T _ DRIuRnEWCoTr,k , 391N | C C LR_uAnLWGoOr_k#<#naclcgloF,u nNcC#C#Lf_uPnRcO,T Ot_y#p#ep,r oFtuon>c(#)#.dreuvnr(e&dnocpcm,. wNoCrCkL)_;A L\G O _| # ^# algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Onote: Tfield 'nthreads' will be initialized after field 'tidInBlock'O _##prot o562> | ( ) . r utni(d&(ntcicdl)S,h mnetmh.rweoardks)(;n t\h r e| a ^d s), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l onote: cfield 'nthreads' will be initialized after field 'tidInBlock'k (thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~~~~~~~t hre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)60,: tnote: ifield 'group' will be initialized after field 'stepSize'd InBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s (warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o m m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)b uffSi z563e | s [ N C CsLt_ePpRSOiTzOe_(SnIcMcPlLSEh]m/eNmC.CcLo_mSmT.EbPuSf/fsSiizzeeosf[(NTC)C)L _{P R O| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ S| I group(groupM PLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:T626E:P9S:/ snote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez eof(T )626) | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | p group(groupr ims(tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:S641t:a11r:t Snote: cin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea tter, n641T | h r e a d s S c a t tperri,m sN(UtLiLd,- tdiidrSect-t>aurpt,R eadrugcse-,> sneTnhdrbeuafdfs,R eadrugcse-,> rdeicrvebcutf-f>,d o w| n ^, &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:c202t:-53>:o unote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, arg s202- | > s e n d b u f fR,u naWrogrsk-E>lreemcevnbtu, 2, 2>::run' requested hereo >(). r202u | n ( w e ) ; | R ^u nWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppm:e12n:t1<:F nnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here T, R e12d | OIpM,P LA_lCgOoL,L _PFrUoNtCo(>A(l)l.Rreudnu(cwee,) ;C O L| L ^N ET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppE:C12T:,1 :S Inote: Min instantiation of member function 'RunWork, 2, 2>::run' requested hereP LE, 12P | rIoMdP,L _dCoOuLbLl_eF)U N C| (^A llRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:c391e:,95 :C Onote: Lexpanded from macro 'IMPL_COLL_FUNC'L NET_DIR E391C | T , RSuInMWPoLrEk,< nPcrcoldF,u ndco#u#bfluen)c , | t^y pe,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :F391u:n95c:# #note: dexpanded from macro 'IMPL_COLL_FUNC'e vredo p391< | t y pReu>n,W oNrCkCd(o)p.n,c cNlCSChLm_eAmL.GwOo_r#k#)a;l g\o , | N ^C CL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15#:# pnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o to>(). r562u | n ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~) , t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n60B:l onote: cfield 'group' will be initialized after field 'stepSize'k (thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~~~~~~~t hr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60):, note: tfield 'group' will be initialized after field 'stepSize'i dInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15]:/ Nwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]C L_STEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:)655,: 11t:idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx), g:r562o:u15p:( gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f ( T| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~563 | | group(group stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e641(:n11c:c lnote: Sin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh mem.c o641m | m . b u f f S i z e sp[rNiCmCsL(_tPiRdO-TtOi_dSSItMaPrLtER]e/dNuCcCeL,_ SnTTEhPrSe/asdiszReeodfu(cTe),) d{i r e| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t - >| d group(groupo wn, &direct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:t626,: 9a:r gnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >send b626u | f f , a r g s -p>rriemcsv(btuifdf-,t i d| S ^t artSca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:t202e:r53,: nnote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh read s202S | c a t t e r , NRUuLnLW,o rdkiErleecmte-n>tuesdeOnpd,b uAflfg,o ,a rPgrso-t>or>e(c)v.bruufnf(,w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp202::1253::1 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 202 | 12 | I M P L _ CROuLnLW_oFrUkNECl(eAmlelnRteL(E),. rPurno(dw,e )d;o u b| l ^e ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::1391:: 95note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 12 | I M391P | L _ CROuLnLW_oFrUkN ,d oNuCbClLe_)A L G| O^_ ##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391N:C95C:L _note: Pexpanded from macro 'IMPL_COLL_FUNC'R OTO_## 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIznesB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL), grouEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562C:O15L:L _warning: Finitializer order does not match the declaration order [-Wreorder-ctor]U NC(AllReduce, 562C | O L L N EtTi_dD(ItRiEdC)T,, nStIhMrPeLaEd,s (Pnrtohdr,e ardcsc)l,_ btfildoIantB1l6o)c k (| t^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(gro u391p | ) , R u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~W o r| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)< nccl F563u | n c # # fsutnecp,S itzyep(en,c cFluSnhcm#e#md.ecvormemd.obpue,s [NNCCCCLL__APLRGOOT_O#_#SaIlMgPoL,E ]N/CNCCLC_LP_RSOTTEOP_S#/#spirzoetoof>((T)).)r u{n ( &| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c c l| S group(grouph mem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :\677 : 11| : ^ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :67715 | : note: field 'nthreads' will be initialized after field 'tidInBlock' p562r | i m s ( ttiidd-(ttiiddS)t,a rnttBhcraesatd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,s: (562nn:Tt15h:hrr eewarning: aainitializer order does not match the declaration order [-Wreorder-ctor]dd ssB)c,a st562ti | ,d I &n dB iltroiecdck(t(tt-ih>dro)eu,at d,nI tddhxir.rexea)cd,ts -(g>nrdtoohurwpen(a,gd rsao)ru,gp s)t-,i> ds Ie| nn ^~~~~~~~~~~~~~~~~Bd lbou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hcf:kf562(,:t 60ha:rr egnote: asfield 'group' will be initialized after field 'stepSize'd- I>drxe .c562xv | )b ,u f gf r, t i| d ^( tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202h:r53e:a dnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:T562_:D15I:R Ewarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]T , SIMPLE, 562P | r o d , tricdc(lt_ibdf)l,o antt1h6r)e a d| s^( nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBlo c391k | ( t hRruenaWdoIrdkx<.nxc)c,l Fgurnocup#(#gfruonucp,) ,t y p| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, F| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n c##de v563r | e d o p i,z eN(CnCcLc_lASLhGmOe_m#.#caolmgmo.,b uNfCfCSLi_zPeRsO[TNOC_C#L#_pPrRoOtToO>_(S)I.MrPuLnE(]&/nNcCcClLS_hSmTeEmP.Sw/osrikz)e;o f\( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 group(group: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :562655 | : 11 : note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d(tid) ,655 | n t h r e a d s ( n tphrriemasd(st)i,d -ttiiddISntBalrotcRke(dtuhcree,a dnITdhxr.exa)d,s Rgerdouucpe(,g rnouulpl)p,t r ,| ^~~~~~~~~~~~~~~~~& direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:-562>:o60u:t ,note: field 'group' will be initialized after field 'stepSize'a rgs->se n562d | b u f f ,t iadr(gtsi-d>)r,e cnvtbhurfefa,d s (| n ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI nBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuepn(tg().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:_562S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d677s:(11n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads), t677i | d I n B l o c k ( t hprreiamdsI(dtxi.dx-)t,i dgSrtoaurpt(Bgcraosutp,) ,n T h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B cast, 563& | d i r e cstt-e>poSuitz,e (dnicrcelcSth-m>edmo.wcno,m ma.rbgusf-f>Ssieznedsb[uNfCfC,L _aPrRgOsT-O>_rSeIcMvPbLuEf]f/,N C C| L ^_ STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:i202z:e53o:f (note: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) ) { 202| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:R562e:d15u:ce ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n ThreadsReduce, 562n | u l l p ttri,d (&tdiidr)e,c tn-t>horueta,d sa(rngtsh-r>esaednsd)b,u ftfi,d IanrBglso-c>kr(etchvrbeuafdfI,d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202(:g53r:o unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , | 202n | d b u f f ,R uanrWgosr-k>Erleecmvebnutf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h():.202r:u53n:( wnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) ; | ^ 202 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppu:n4W:o1r:k Enote: lin instantiation of member function 'RunWork, 2, 2>::run' requested heree ment <4F | nI,M PTL,_ CROeLdLO_pF,U NACl(gAol,l RPerdoutcoe>,( )C.OrLuLnN(EwTe_)D;I R E| C ^T , SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppE:,5 :S1u:m ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested herei nt8_t )5 | I| M^P L_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391F:U95N:C (note: Aexpanded from macro 'IMPL_COLL_FUNC'l lRedu c391e | , CROuLnLWNoErTk_:,95 :N Cnote: Cexpanded from macro 'IMPL_COLL_FUNC'L _ALGO_# #391a | l g oR,u nNWCoCrLk_c(,) .tryupne(,& nFcucnlcS#h#mdeemv.rweodrokp)<;t y\p e >| , ^ NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lnote: gfield 'nthreads' will be initialized after field 'tidInBlock'o , NCC L562_ | P R O T Ot_i#d#(ptriodt)o,> (n)t.hrruena(d&sn(cnctlhSrhemaedms.)w,o rtki)d;I n\B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~i d),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~r eadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnet/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hahd:rs562e):,a15 d:ts iwarning: )dinitializer order does not match the declaration order [-Wreorder-ctor],I ntBild oI562cn | kB ( lt ho rtceiakdd((Ittdhixrd.)ex,a )n,td hgIrrdeoxau.dpx(s)g(r,no tughprr)e,oa du s| p) ^~~~~~~~~~~~~~~~~(, g r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hto:iu562d:pI60)n:,B l note: ofield 'group' will be initialized after field 'stepSize' c | k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t 562h | r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) a d Itdixd. (x563t) | i,d )g, r o nut phs(rgteraeodpusSp(i)n,zt he r(| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~na cd cs| )l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), S ht mi563d | eI nm B. lc oosctmkemp(S.tibhzuerf(efnaScdicIlzdSxeh.msxe[m).N,cC oCgmLmr_.oPbuRupOf(fTgSOri_ozueSps)I[,MN PC CL| LE ^~~~~~~~~~~_ ]P/RNOCTCOL__SSITMEPPLSE/]s/iNzCeCoLf_(STT)E)P S{/ s i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e o f| ( group(groupT )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : group(group677 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 655 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pri m655s | ( t i d - t i d S t aprrtiBmcsa(stti,d -ntTihdrSetaadrstBRceadsutc,e ,& dniTrherceta-d>soRuetd,u cdei,r encutl-l>ptdro,w n&,d iarregcst-->>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hsoeuntd:,b562u :fa15fr:,g swarning: a-initializer order does not match the declaration order [-Wreorder-ctor]>r gsse-n>drbe uc562fvf | ,b u af rf g,ts -i> dr| (e ^ct vibdu)f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,:f ,202n :t 53h| :r ^ e note: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nt h202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr: | 202 e: a53 :d s )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tRiud nI202W | no Br lk oE cl ke (m te nRhtu O(| )p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~., r uA| nl tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)(g woe,) ;563 P | r | o ^t o >s(t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppe.:pr4Su:in1(z:wee ()note: n;in instantiation of member function 'RunWork, 2, 2>::run' requested herec c| l ^S4 h | mIeMmP.Lc_o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppCm:Om4L.:Lb1_u:FfU fNSnote: Ciin instantiation of member function 'RunWork, 2, 2>::run' requested here(z Aelsl[RN e4Cd | CuILcM_ePP,LR_ OCCTOOOLLL_LNS_EIFTMU_PNDCLI(ERA]El/ClNRTCe,C dLSu_IcSeMT,P ELCPEOS,L /LSNsuEimTz,_e DoiIfnR(tTE8)C_T)t, ) {S I| M ^P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | S:u group(group391m :,95 :i nnote: texpanded from macro 'IMPL_COLL_FUNC'8 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:t666) : 391 9 | | : ^ note: Rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o 391r666k: | 95<: nnote: expanded from macro 'IMPL_COLL_FUNC'c c l F u n c391p# | r# if muRsunn(cWt,o rtkyi#c,#t d-Ne>CvuCrpLe,_d AoLNpGUaal,r ggNosC,-C >LNs_CeACnLLdGb_OP_uR#fO#fTa,Ol g_ao#r,#g psrN-oC>CtrLoe_>cP(vR)bO.urTfuOfn_,(# &# np| cr ^co ltSoh>m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(e):.m202r.:uw53no:(r &knote: n)in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec;c l S\ h 202 m | e| m ^ . w o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h r: k562 ): ;15R :u\ n note: W field 'nthreads' will be initialized after field 'tidInBlock'o| r ^k E l562e | m e /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn :t <562tFi:nd15,(: t Tnote: i,field 'nthreads' will be initialized after field 'tidInBlock'd )R,e dnOt p562h, | r eA al dgs o(t,n itPdhrr(oteitadod)s>,)( ,)n .ttrhiudrnIe(nawBdels)o(;cn k t(| ht ^rh eraeda/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppds:I5)d,:x 1.t:ix d)note: I,in instantiation of member function 'RunWork, 2, 2>::run' requested heren Bglr oo5uc | pkI((gMtrPhoLrue_paC)dO,IL d L| x_ ^~~~~~~~~~~~~~~~~F .UxN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h)C:(562,A: l60lg:Rr eonote: dufield 'group' will be initialized after field 'stepSize'up c(eg,r o 562uC | pO )L, L N E t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^ :562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: 563note: | field 'group' will be initialized after field 'stepSize' s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkh,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k )| ; tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) \ | ^563 | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:p562S:i15z:e (note: nfield 'nthreads' will be initialized after field 'tidInBlock'c clShmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePaRdOsT(On_tShIrMePaLdEs])/,N CtCiLd_ISnTBElPoSc/ks(itzheroefa(dTI)d)x .{x ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp (group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626 :| 9 ^~~~~~~~~~~~~~~~~: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60626: | note: field 'group' will be initialized after field 'stepSize' p r562i | m s ( t itdi-dt(itdiSdt)a,r tnSctahtrteeard,s (nnTthhrreeaaddssS)c,a tttiedrI,n BNlUoLcLk,( tdhirreeacdtI-d>xu.px,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~g s->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :5:1: note: nin instantiation of member function 'RunWork, 2, 2>::run' requested hereu llptr, 5a | rIgMsP-L>_sCeOnLdLb_uFfUfN,C (aArlglsR-e>drueccev,b uCfOfL,L NE T| _ ^D IRECT, SIMPLE, Sum, ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t2028:_53t:) note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202391 | : 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunW o391r | k E lReumneWnotrn(c)#.#rduenv(rweed)o;p < t| y ^p e>, NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppL:G4O:_1#:# anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, NC C4L | _IPMRPOLT_OC_O#L#Lp_rFoUtNoC>((A)l.lrRuend(u&cnec,c lCSOhLmLeNmE.Tw_oDrIkR)E;C T\, S| I ^M PLE, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:m562,: 15i:n tnote: 8field 'nthreads' will be initialized after field 'tidInBlock'_ t) | ^562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~~~~~~~N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(gro u562p | ) , | t ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hreads(n:t562hr:e15: warning: initializer order does not match the declaration order [-Wreorder-ctor]a ds), tidInBlock(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:C562T:,15 :S Iwarning: Minitializer order does not match the declaration order [-Wreorder-ctor]P LE, Sum ,562 | u i n t 8t_itd)( t i| d^) , nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:( nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads), 391t | i d IRnuBnlWoocrkk( , N CsCtLe_pASLiGzOe_(#n#caclShmelmg.oc,o mNmC.CbLu_fPfRSOiTzOe_s#[#NpCrCoLt_oP>R(O)T.Or_uSnI(M&PnLcEc]l/SNCCL_hSmTeEmP.Sw/osrikz)e;o f\( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : group(group562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641: 11562: | note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid(tid )641, | n t h r e a d s ( nptrhirmesa(dtsi)d,- ttiiddSItnaBrltoRcekd(utcher,e andTIhdrxe.axd)s,R egdruocuep,( gdrioruepc)t,- > d| o ^~~~~~~~~~~~~~~~~w n, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:r562e:c60t:- >note: ofield 'group' will be initialized after field 'stepSize'u t, arg s562- | > s e n dtbiudf(ft,i da)r,g sn-t>hrreecavdbsu(fnft,h r e| a ^d s), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202I:n53B:l onote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek (th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~ T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:r562e:c15t:- >warning: oinitializer order does not match the declaration order [-Wreorder-ctor]u t, args -562> | s e n d btuifdf(,t iadr)g,s -n>trhercevabdusf(fn,t h r| e ^a ds), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l202o:c53k:( tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eadId x202. | x ) , g r o u pR(ugnrWoourpk)E,l e m| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t <| F tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n , T, R563e | d O p , sAtlegpoS,i zPer(ontcoc>l(S)h.mreumn.(cwoem)m;. b u| f ^f Sizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp_P:R5O:T1O:_ Snote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereM PLE]/ N5C | CILM_PSLT_ECPOSL/Ls_iFzUeNoCf((ATl)l)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, C| O group(groupL LNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:C666T:,9 :S Inote: Min instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP LE, S u666m | , u i n t 8 _ tp)r i m| s^( tid, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:T391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'G ather, 391d | i r eRcutn-W>ourpk,< nNcUcLlLF,u nacr#g#sf-u>nsce,n dtbyupfef,, Faurngcs#-#>dreevcrvebduofpf<,t y p| e ^> , NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:L202G:O53_:# #note: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel go, 202N | C C L _ P R O T OR_u#n#WporroktEol>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) :562 :56315 | : warning: initializer order does not match the declaration order [-Wreorder-ctor] stepSize( n562c | c l S h mteimd.(ctoimdm).,b untfhfrSeiazdess([nNtChCrLe_aPdRsO)T,O _tSiIdIMnBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 641:11 :563 | note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here stepS i641z | e ( n c c l S h m e mp.rciommsm(.tbiudf-ftSiidzSetsa[rNtCRCLe_dPuRcOeT,O _nSTIhMrPeLaEd]s/RNeCdCuLc_eS,T EdPiSr/escitz-e>odfo(wTn),) &{d i r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c t -| > group(groupo ut, args->sendbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:f687,: 11a:r gnote: sin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >rec v687b | u f f , | ^ prims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202-:t53i:d Stnote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer tBcas t202, | n T h r e a d sRBucnaWsotr,k E&ldeimreenctt<-F>no,u tT,, nRueldlOppt,r ,A lagrog,s -P>rsoetnod>b(u)f.fr,u na(rwges)-;> r e| c ^v buff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :5:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here5 | IM P202L | _ C O L L _ F URNuCn(WAolrlkREeldeumceen,t< FCnO,L LTN,E TR_eDdIORpE,C TA,l gSoI,M PPLrEo,t oS>u(m),. ruuinn(tw8e_)t;) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::3915::951:: note: note: expanded from macro 'IMPL_COLL_FUNC'in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | 391I | M P LR_uCnOWLoLr_kF,, u iNnCtC8L__tA)L G O| _^# #al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:o391,: 95N:C Cnote: Lexpanded from macro 'IMPL_COLL_FUNC' _PROT O391_ | # # pRruontWoo>r(k)<.nrcucnl(F&unnccc#l#Sfhumnecm,. wtoyrpke),; F\u n c| # ^# devred/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:p562<:t15y:p enote: >field 'nthreads' will be initialized after field 'tidInBlock', NCCL_A L562G | O _ # # atligdo(,t iNdC)C,L _nPtRhOrTeOa_d#s#(pnrtohtroe>a(d)s.)r,u nt(i&ndcIcnlBSlhomcekm(.twhorreka);d I\d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nthre a562d | s ( n t htreiadd(st)i,d )t,i dnItnhBrleoacdks(t(hnrtehardeIdaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e | a ^~~~~~~~~~~~~~~~~d Id/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~562 :15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p(group), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T E P| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : pwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]i ms(tid-t i562d | S t a r ttSicda(tttiedr),, nnTthhrreeaaddssS(cnatthtreera,d sN)U,L Lt,i ddIinrBelcotc-k>(utph,r eaardgIsd-x>.sxe)n,d bgurfofu,p (agrrgosu-p>)r,e c v| b ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u f f| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :s202t:e53p:S inote: zin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree (ncc l202S | h m e m . c o m mR.ubnuWfofrSkiEzleesm[eNnCtC/(s)i.zreuonf((wTe))); { | ^| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 687:11: 7note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI MPL_C O687L | L _ F U N C ( A l l Rperdiumcse(,t iCdO-LtLiNdESTt_aDrItRBEcCaTs,t ,S InMTPhLrEe,a dSsuBmc,a suti,n t&3d2i_rte)c t -| >^o ut, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:l391l:p95t:r ,note: expanded from macro 'IMPL_COLL_FUNC'a rgs->s e391n | d b uRfufn,W oarrkgcrleFcuvnbcu#f#ff,u n c| , ^ type,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :F202u:n53c:# #note: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree vre d202o | p < t y p e > , RNuCnCWLo_rAkLEGlOe_m#e#natlo(t)o.>r(u)n.(r&unnc(cwleS)h;m e m| . ^w ork);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :\6 : 1: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ 6 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L15_:C Onote: Lfield 'nthreads' will be initialized after field 'tidInBlock'L _FUNC (562A | l l R e dtuicde(,t iCdO)L,L NnEtTh_rDeIaRdEsC(Tn,t hSrIeMaPdLsE),, Stuimd,I niBnlto3c2k_(tt)h r e| a^d Idx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group) ,391 | | ^~~~~~~~~~~~~~~~~R unW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:<60n:c cnote: lfield 'group' will be initialized after field 'stepSize'F unc## f562u | n c , ttyipde(,t iFdu)n,c #n#tdherveraeddso(pnd,s )N,C CtLi_dAILnGBOl_o#c#ka(ltghor,e aNdCICdLx_.PxR)O,T Og_r#o#uppr(ogtroo>u(p)).,r u n| ( ^~~~~~~~~~~& ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h7::5621::15 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herewarning: initializer order does not match the declaration order [-Wreorder-ctor] 7 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aCdOsL(LnNtEhTr_eDaIdRsE)C,T ,t iSdIIMnPBLlEo,c kS(utmh,r euaidnItd3x2._xt)), g| r^o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95):, note: expanded from macro 'IMPL_COLL_FUNC'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563R | u n W o rskt_,S INMCPCLLE_]A/LNGCOC_L#_#SaTlEgPoS,/ sNiCzCeLo_fP(RTO)T)O _{# # p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o t o| > group(group( ).run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:c641l:S11h:m enote: min instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. work); 641\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562p:r15i:m snote: (field 'nthreads' will be initialized after field 'tidInBlock't id-ti d562S | t a r t Rteiddu(ctei,d )n,T hnrtehardesaRdesd(uncteh,r edaidrse)c,t -t>iddoIwnnB,l o&cdki(rtehcrte-a>doIudtx,. xa)r,g sg-r>osuepn(dgbruofufp,) ,a r g| s ^~~~~~~~~~~~~~~~~- >r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b60u:f fnote: ,field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:n562t:3152:_ twarning: )initializer order does not match the declaration order [-Wreorder-ctor] | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid), 391n | t h rReuandWso(rnkto,u pN)C,C L _| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L G O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #alg o563, | N C C Ls_tPeRpOSTiOz_e#(#npcrcoltSoh>m(e)m..rcuonm(m&.nbcucflfSShimzeems.[wNoCrCkL)_;P R\O T O| _ ^S IMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C15L:_ Snote: Tfield 'nthreads' will be initialized after field 'tidInBlock'E PS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e641a:d11s:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret idInBl o641c | k ( t h r e a d I d xp.rxi)m,s (gtriodu-pt(igdrSotuapr)t,R e d| u ^~~~~~~~~~~~~~~~~c e, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:h562r:e60a:d snote: Rfield 'group' will be initialized after field 'stepSize'e duce, d562i | r e c t -t>iddo(wtni,d )&,d inrtehcrte-a>dosu(tn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) , | g ^r oup(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^~~~~~~~~~~: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 60 : tnote: ifield 'group' will be initialized after field 'stepSize'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup) ,563 | | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^ :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::15391:: 95warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: expanded from macro 'IMPL_COLL_FUNC' 391 | 562R | u n W o rtkie,a dNICdCxL._xA)L,G Og_r#o#uapl(ggor,o uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O T O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #prot o563> | ( ) . r usnt(e&pnSciczleS(hnmcecml.Swhomrekm).;c o\m m .| b ^u ffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C15C:L _note: Pfield 'nthreads' will be initialized after field 'tidInBlock'R OTO_S I562M | P L E ] /tNiCdC(Lt_iSdT)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | t group(groupi dInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:x687.:x11):, note: gin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup(gr o687u | p ) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562i:m60s:( tnote: ifield 'group' will be initialized after field 'stepSize'd -tid S562t | a r t B ctaisdt(,t indT)h,r enatdhsrBecaadsst(,n t&hdrieraedcst)-,> otuitd,I nnBullolcpkt(rt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~f f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS/siz:e562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho::f15562(::T 15)warning: :)initializer order does not match the declaration order [-Wreorder-ctor] warning: {initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562562 | | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h t:ti626id:d(9(:tt iinote: ddin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here)) ,, nntthh rr626ee | aa dd ss (( nn tt hh rr eepaarddissm))s,,( tttiiidd-dIItnniBBdllSootccakkr((tttShhcrraeetaatddeIIrdd,xx ..nxxT))h,,r eggarrdoosuuSppc((aggtrrtooeuurpp,)) ,,N U L| | L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, d | i| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) e ct -563> | u 563p | , as rt gesspt-Se>ipszSeein(zdnebc(ucfnlfc,S chalmrSeghmms.-ec>mor.mecmco.vmbbmuu.ffbffuS,fi fz Se| is ^z[ eNsC[CNLC_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hPL:R_202OP:T53ROO:_T SOnote: I_in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereMS PILME P]202L/ | EN ]C /C NL C _C SL T_ ESRPTuSEn/PWsSoi/rzkseEiolzfee(moTef)n()tT <){F) n ,{| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | R group(group e | d group(groupO p, Algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:r641/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho::t68711o::>11 (:note: ) in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here.note: rin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n( w641e | ) ; 687 | | ^ p r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp i: 6mp:sr1(i:tm inote: sdin instantiation of member function 'RunWork, 2, 2>::run' requested here(- ttiidd -S6tt | iaIdrMStPtRLae_rdCtuOBcLceLa,_s FtnU,TN hCnr(TeAhalrdlesRaRededsdBuucccaees,,t ,Cd Oi&LrdLeiNcrEteT-c_>tDd-Io>RwEonCu,Tt ,,& dSniuIrMlePlcLptEt-,r> ,oS uuatmr,,g sai-rn>gtss3e-2n>_dstbe)un fd fb| ,u^ f afr,g s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha-:rgs->r>391er:ce95vc:bv ubnote: ufexpanded from macro 'IMPL_COLL_FUNC'ff f,, | | 391 ^ ^ | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<::n53202c::c 53lnote: :Fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here u note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec # #202f | u 202n | c , t y p eR ,u nRFWuuonnrWcko#Er#lkdeEemlveernmetedR,,e dRNOeCpdC,OL p_A,A lLAgGloOg,_o #,P# raPolrtgooot>,o( >)N(.C)rC.uLrn_u(PnwR(eOw)Te;O) _; # | # ^p| r ^o to>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp):.7/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr::u71n::(1 &:note: nin instantiation of member function 'RunWork, 2, 2>::run' requested here c note: cin instantiation of member function 'RunWork, 2, 2>::run' requested herel S h 7m7 | eI | mMI.PMwLoP_rLCk_O)CL;OL L_\LF _U FN| UC ^N( CA(lAl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlR:le562Rd:eu15dc:ue ,cnote: efield 'nthreads' will be initialized after field 'tidInBlock'C, O LCLON LE562LT | N_ ED TI _R DEtICiRTdE,(C tTSi,Id M)SP,IL MEnP,tL hESr,ue maS,du smu(,in ntuthi3rn2et_a3td2)s_ )t ,)| ^t i| d^I n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l391o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:c:95k391:(: t95note: h:expanded from macro 'IMPL_COLL_FUNC'r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Id x391. | x )391 , | R u gnRrWuoonurWpko( p,ne t>Nh,Cr CeNLa_CdACsLL(G_nOAt_Lh#Gr#Oea_al#dg#soa),l, g NotC,iC dLNI_CnPCBRLlO_oTPcORk_O(#Tt#Ohp_rr#eo#atpdorI>od(tx)o..>xr().ru)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] group(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nth r563e | a d s ( nsttherpeSaidzse)(,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdixz.exs)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _STE P563S | / s i z esotfe(pTS)i)z e{( n c| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l S h| m group(groupe m.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hb:u626f:f9S:i znote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres [NCCL_ P626R | O T O _ S I M P LpEr]i/mNsC(CtLi_dS-TtEiPdSS/tsairzteSocfa(tTt)e)r ,{ n T| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups Scatter/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641N:U11L:L ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered irect -641> | u p , a r g s - > spernidmbsu(ftfi,d -atrigdsS-t>arretcRvebduufcfe,, n| T ^h readsRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:u202c:e53,: dnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ect- >202d | o w n , & d i rReucntW-o>rokuEtl,e maerngts<-F>ns,e nTd,b uRfefd,O pa,r gAsl-g>or,e cPvrboutfof>,( ) .| r ^u n(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)r, groupe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::39115::95 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: expanded from macro 'IMPL_COLL_FUNC' 391562 | | R u ntWiodr(ktI,d xN.CxC)L,_ AgLrGoOu_p#(#garloguop,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_##p r563o | t o > ( )s.treupnS(i&znec(cnlcSchlmSehmm.ewmo.rcko)m;m .\b u f| f ^S izes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C15L:_ Pnote: Rfield 'nthreads' will be initialized after field 'tidInBlock'O TO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:(641t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI dx.x), 641g | r o u p ( g r o u p )p,r i m| s ^~~~~~~~~~~~~~~~~( ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i60d:S tnote: afield 'group' will be initialized after field 'stepSize'r tRedu c562e | , n T htrieda(dtsiRde)d,u cnet,h rdeiardesc(tn-threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_PROTO_ #562# | p r o t ot>i(d)(.triudn)(,& nnctchlrSehamdesm(.nwtohrrke)a;d s\) , | t ^i dInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threa d563s | ) , t isdtIenpBSliozcek((ntchcrleSahdmIedmx..cxo)m,m .gbruofufpS(igzreosu[pN)C,C L _| P ^~~~~~~~~~~~~~~~~R OT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626t:i9d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(th r626e | a d I d x . x ) ,p rgirmosu(pt(igdr-otuipd)S,t a r| t ^~~~~~~~~~~S catter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h\ | ^: 562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60563: | note: field 'group' will be initialized after field 'stepSize' step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp_:A7L:G1O:_ #note: #in instantiation of member function 'RunWork, 2, 2>::run' requested herea lgo, N7C | CILM_PPLR_OCTOOL_L#_#FpUrNoCt(oA>l(l)R.erduunc(e&,n cCcOlLSLhNmEeTm_.DwIoRrEkC)T;, \S I M| P ^L E, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:,562 :u15i:n tnote: 3field 'nthreads' will be initialized after field 'tidInBlock'2 _t) | 562^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~~~~~~~ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:A562L:G60O:_ #note: #field 'group' will be initialized after field 'stepSize'a lgo, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k (| t ^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:c owarning: minitializer order does not match the declaration order [-Wreorder-ctor]m .buffSize s562[ | N C C L _tPiRdO(TtOi_dS)I,M PnLtEh]r/eNaCdCsL(_nStThErPeSa/dssi)z,e otfi(dTI)n)B l{o c k| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa dIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(626g:r9o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 626 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | 563 | p r i msst(etpiSdi-ztei(dnSctcalrSthSmceamt.tceorm,m .nbTuhfrfeSaidzseSsc[aNtCtCeLr_,P RNOUTLOL_,S IdMiPrLeEc]t/-N>CuCpL,_ SaTrEgPsS-/>ssieznedobfu(fTf),) a{r g s| - ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~> r e| c group(groupv buff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 677:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 677 | 202 | p r iRmusn(Wtoirdk-EtliedmSetnatrr(e)c.tr-u>no(uwte,) ;d i r| e ^c t->do/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppw:n9,: 1a:r gnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here- >sen d9b | uIfMfP,L _aCrOgLsL-_>FrUeNcCv(bAulflfR,e d u| c ^e , COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:N202E:T53_:D Inote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE CT, S202I | M P L E , S u mR,u nuWionrtk6E4l_etm)e n t| <^F n, T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391R:e95d:O pnote: ,expanded from macro 'IMPL_COLL_FUNC' Algo, 391P | r o tRou>n(W)o.rrku, 2, 2>::run' requested hereu nc## d8e | vIrMePdLo_pCU,N CN(CAClLl_RAeLdGuOc_e#,# aClOgLoL,N ENTC_CDLI_RPERCOTT,O _S#I#MpPrLoEt,o >S(u)m.,r uinn(t&6n4c_ctl)S h m| e^m .wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391):;95 :\ note: expanded from macro 'IMPL_COLL_FUNC'| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h391: | 562 : 15R:u nnote: Wfield 'nthreads' will be initialized after field 'tidInBlock'o rkI,n BNlCoCcLk_(AtLhGrOe_a#d#Iadlxg.ox,) ,N CgCrLo_uPpR(OgTrOo_u#p#)p,r o t| o ^~~~~~~~~~~~~~~~~> ()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n60(:& nnote: cfield 'group' will be initialized after field 'stepSize'c lShme m562. | w o r k )t;i d\( t i| d ^) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s (note: nfield 'nthreads' will be initialized after field 'tidInBlock't hread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~k (threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r60e:a dnote: Ifield 'group' will be initialized after field 'stepSize'd x.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(nth r563e | a d s ) ,s tteipdSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u fgfrSoiuzpe(sg[rNoCuCpL)_,P R O| T ^~~~~~~~~~~O _SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tm, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_A L562G | O _ # # atligdo(,t iNdC)C,L _nPtRhOrTeOa_d#s#(pnrtohtroe>a(d)s.)r,u nt(i&dnIcncBllSohcmke(mt.hwroerakd)I;d x\. x )| , ^ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pnote: )field 'nthreads' will be initialized after field 'tidInBlock', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid (563t | i d ) , snttehprSeiazdes((nnctchlrSehamdesm).,c otmimd.IbnuBflfoScikz(etsh[rNeCaCdLI_dPxR.OxT)O,_ SgIrMoPuLpE(]g/rNoCuCpL)_,S T E| P ^~~~~~~~~~~~~~~~~S /s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:o60f:( Tnote: )field 'group' will be initialized after field 'stepSize') { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 | | group(group tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 687n:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nthr e687a | d s ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,B cgarsotu,p (ngTrhoruepa)d,s B c| a ^~~~~~~~~~~s t, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); e| ^ dOp, Algo, Proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp(:w9e:)1;: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ 9 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppO:L8L:_1F:U Nnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested here( AllR e8d | uIcMeP,L _CCOOLLLLN_EFTU_NCD(IARlElCRTe,d uScIeM,P LCEO,L LSNuEmT,_ DuIiRnEtC6T4,_ tS)I M P| L^E , Sum, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:n391t:6954:_ tnote: )expanded from macro 'IMPL_COLL_FUNC' | ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r ku,n cN562#C | C#L d_ Ae Lv GrOte_i#d#doa(lptgLn_,tP hRNOrTCeOC_aL#d_#sAp(rLnoGttOoh_>r#(e#)a.adrlsug)no,(, & tnNciCcdlCISLhn_mBePml.RwoOocTkrO(k_)t#;h# rp\er ao dt| Io ^d> x(.)x.)r,u ng(r&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:nu562c:pc15(:lg Snote: rhfield 'nthreads' will be initialized after field 'tidInBlock'om uepm).,w o r | k562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ) ; | \ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) t i d| ( t ^563i | d ) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hns:tth562er:pe15Sa:di sz(note: enfield 'nthreads' will be initialized after field 'tidInBlock'(t hnrceca dl562sS) | h, m et imd I.tncBilodomc(mkt(.itbhdur)ef,af dSInidtxzh.erxse)[a,N dCgsCr(Lonu_tpP(hRgrrOoeuTpaO)d_,s S) I,| M ^~~~~~~~~~~~~~~~~ P tLiEd]I/n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:B562Cl:Co60L:c _knote: S(field 'group' will be initialized after field 'stepSize'Tt EhPrSe/asdi I562zd | e x o . fxt()iT,d) ()gt ri{do )u ,p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ng trh roe| ua group(groupdps ()n,t h r| e ^~~~~~~~~~~~~~~~~a d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h)687:,:562 11t:i:60d :Inote: nin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: B field 'group' will be initialized after field 'stepSize'l ock (687t | h562 r | e a d I d x . xt )i, dg (r topiurpdi()mg,sr (ontutpih)dr,-e ta id| ds ^~~~~~~~~~~S( tnatrhtrBecaadsst),, ntTihdrIenaBdlsoBccka(stth,r e&addiIrdexc.tx-)>,o ugtr,o unpu(lglrpoturp,) ,a r g| s ^~~~~~~~~~~- >sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 563warning: | initializer order does not match the declaration order [-Wreorder-ctor] step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| group(group | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h563: | 626 : 9 : snote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree pSize( n626c | c l S h m e m . cpormimm.sb(utfifdS-itziedsS[tNaCrCtLS_cPaRtOtTeOr_,S InMTPhLrEe]a/dNsCSCcLa_tStTeErP,S /NsUiLzLe,o fd(iTr)e)c t{- > u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f666f:,9 :a rnote: gin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ->re c666v | b u f f , | ^p rims(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202T:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres Gather ,202 | d i r e c t - > uRpu,n WNoUrLkLE,l eamregnst-<>Fsne,n dTb,u fRfe,d Oapr,g sA-l>groe,c vPbruoftfo,> ( )| . ^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp | : 9 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here RunWor k9E | lIeMmPeLn_tCT(_)D.IrRuEnC(Tw,e )S;I M P| L ^E , Sum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 9u:i1n:t 6note: 4in instantiation of member function 'RunWork, 2, 2>::run' requested here_ t) 9| | ^I MPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L391L:_95F:U Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'( AllRedu c391e | , CROuLnLWNoErTk_:, note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_AL G391O | _ # #RaulngWoo,r kNy(p)e.,r uFnu(n&cn#c#cdleSvhrmeedmo.pw ,\ N C| C ^L _ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:#562a:l15g:o ,note: field 'nthreads' will be initialized after field 'tidInBlock'N CCL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B loc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's ), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~I nB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkrecvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 4_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hims(t:i562d:-15t:i dwarning: Sinitializer order does not match the declaration order [-Wreorder-ctor]t artReduce, n T562h | r e a d stRiedd(utcied,) ,n unltlhprtera,d s&(dnitrherceta-d>so)u,t ,t iadrIgnsB-l>oscekn(dtbhurfefa,d Iadrxg.sx-)>,r egcrvobuupf(fg,r o u| p ^) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here563 | s202t | e p S i z e ( n cRculnSWhomrekmE.lceommemn.tb](/)N.CrCuLn_(SwTeE)P;S / s| i ^z eof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :{10 : 1| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| group(group 10 | IMPL_COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:F687U:N11C:( Anote: lin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel Reduc e687, | C O L L N E T _ D IpRrEiCmTs,( tSiIdM-PtLiEd,S tSaurmt,B chaasltf,) n T| h^r eadsB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:a391s:t95,: ¬e: dexpanded from macro 'IMPL_COLL_FUNC'i rect-> o391u | t , RnuunlWloprtkr<,n cacrlgFsu-n>cs#e#nfdubnucf,f ,t yapreg,s -F>urnecc#v#bduefvfr,e d o| p ^< type>, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:A53L:G Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# #alg o202, | N C C L _ P R ORTuOn_W#o#rpkrEolteom>e(n)t. ^( ).run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:w562e:)15;: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp | : 10 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid) ,10 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Sgurmo,u ph)a,l f )| ^~~~~~~~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h60::391 :note: 95field 'group' will be initialized after field 'stepSize': note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dRunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 60 : snote: tfield 'group' will be initialized after field 'stepSize'e pSize (562n | c c l S htmiedm(.tciodm)m,. bnutfhfrSeiazdess([nNtChCrLe_aPdRsO)T,O _tSiIdMIPnLBEl]o/cNkC(CtLh_rSeTaEdPISd/xs.ixz)e,o fg(rTo)u)p ({g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ) ,| group(group | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m e m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)w ork) ;563 | \ | ^s tepSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:(562n:c15c:l Snote: hfield 'nthreads' will be initialized after field 'tidInBlock'm em.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:)687,: 11 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60687: | note: field 'group' will be initialized after field 'stepSize' 562p | r i m s (ttiidd(-ttiidd)S, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.work:)562;: 15\: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60s:t enote: pfield 'group' will be initialized after field 'stepSize'S ize(n c562c | l S h m etmi.dc(otmimd.)b,u fnftShirzeeasd[sN(CnCtLh_rPeRaOdTsO)_,S ItMiPdLIEn]B/lNoCcCkL(_tShTrEePaSd/Isdixz.exo)f,( Tg)r)o u{p ( g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ) group(group, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n>out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ,warning: initializer order does not match the declaration order [-Wreorder-ctor]S um, uint 65624 | _ t ) t| i^d (tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391n:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd s(nthre a391d | s ) ,R utniWdoIrnkB, N C563C | L _ A L GsOt_e#p#Sailzgeo(,n cNcClCSLh_mPeRmO.TcOo_m#m#.pbruoftfoS>i(z)e.sr[uNnC(C&Ln_cPcRlOSThOm_eSmI.MwPoLrEk])/;N C\C L _| ^S TEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f15(:T )note: )field 'nthreads' will be initialized after field 'tidInBlock' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d655):,11 :n tnote: hin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads(n t655h | r e a d s ) , t i dpIrniBmlso(ctki(dt-htriedaSdtIadrxt.Rxe)d,u cger,o unpT(hgrreoaudps)R,e d u| c ^~~~~~~~~~~~~~~~~e , n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:l562l:p60t:r ,note: field 'group' will be initialized after field 'stepSize'& direc t562- | > o u t ,t iadr(gtsi-d>)s,e nndtbhurfefa,d sa(rngtsh-r>eraedcsv)b,u ftfi,d I n| B ^l ock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:I53d:x .note: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , gr o202u | p ( g r o u p ) ,R u n| W ^~~~~~~~~~~o rkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562F:U15N:C (warning: Ainitializer order does not match the declaration order [-Wreorder-ctor]l lReduce, 562C | O L L N EtTi_dD(ItRiEdC)T,, nStIhMrPeLaEd,s (Snutmh,r euaidnst)6,4 _tti)d I n| B^l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), g391r | o u pR(ugnrWoourpk)<,n c c| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~F u n| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #func ,563 | t y p e ,s tFeupnSci#z#ed(envcrceldSohpm, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum,n doublcec)l F u| n^c ##fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:,391 :t95y:p enote: ,expanded from macro 'IMPL_COLL_FUNC' Func##d e391v | r e dRoupnn,c cNlCFCuLn_cA#L#GfOu_n#c#,a ltgyop,e ,N CFCuLn_cP#R#OdTeOv_r#e#dporpop(e)>.,r uNnC(C&Ln_cAcLlGSOh_m#e#ma.lwgoor,k )N;C C\L _ P| R ^O TO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562&:n15c:c lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'h mem.work )562; | \ | t ^i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 15n:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)60,: gnote: rfield 'group' will be initialized after field 'stepSize'o up(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~d (t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,60 :n tnote: hfield 'group' will be initialized after field 'stepSize'r eads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~d Idx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N CgCrLo_uApL(GgOr_o#u#pa)l,g o ,| ^~~~~~~~~~~N CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), oto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:562:15: wwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]r k); \ 562| | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~c lShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c60o:m mnote: .field 'group' will be initialized after field 'stepSize'b uffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677g:r11o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup )677, | | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | z^e s[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L391_:P95R:O Tnote: Oexpanded from macro 'IMPL_COLL_FUNC'_ SIMPLE] /391N | C C LR_uSnTWEoPrSk/11,: Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC L_ALGO_# #677a | l g o , N C C L _ PpRrOiTmOs_(#t#ipdr-ottiod>S(t)a.rrtuBnc(a&sntc,c lnSThhmreema.dwsoBrcka)s;t ,\ & d| i ^r ect->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562t:,15 :d inote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ct->d o562w | n , a rtgisd-(>tsiedn)d,b unftfh,r eaardgss(-n>trhercevabdusf)f,, t i| d ^I nBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g r202o | u p ( g r o u p )R,u n W| o ^~~~~~~~~~~~~~~~~r kEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562e:n60t:< Fnote: nfield 'group' will be initialized after field 'stepSize', T, R e562d | O p , Atligdo(,t iPdr)o,t on>t(h)r.eraudns((wnet)h;r e a| d ^s ), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppB:l11o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eadI d11x | .IxM)P,L _gCrOoLuLp_(FgUrNoCu(pA)l,l R e| d ^~~~~~~~~~~u ce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]563 | ste p562S | i z e ( ntcicdl(Sthimde)m,. cnotmhmr.ebaudfsf(Snitzherse[aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et]h/rNeCaCdLI_dSxT.ExP)S,/ sgirzoeuopf((gTr)o)u p{) , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | group(group| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677 : 11s:t enote: pin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS ize(nc c677l | S h m e m . c o m m .pbruifmfsS(itzieds-[tNiCdCSLt_aPrRtOBTcOa_sStI,M PnLTEh]r/eNaCdCsLB_cSaTsEtP,S /&sdiizreeocft(-T>)o)u t{, d| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e c| t group(group- >down, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:-687>:s11e:n dnote: bin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ff, arg s687- | > r e c v b u f f , p r| i ^m s(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:S53t:a rnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereB cast ,202 | n T h r e a d s BRcuansWto,r k&Edliermeecntt-<>Fonu,t ,T ,n uRleldpOtpr,, Aalrggos,- >Psreontdob>u(f)f.,r uanr(gwse-)>;r e c| v ^b uff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :| 13 ^: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :13202 | :I53M:P Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC OLL_F U202N | C ( A l l R e d uRcuen,W oCrOkLELlNeEmTe_nDtI().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ , rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hNCCL_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>().run(&n c562c | l S h m etmi.dw(otrikd));, \n t h| r ^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~b uffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s60[:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:.626x:)9,: gnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up(gro u626p | ) , | ^~~~~~~~~~~ prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p (group), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T E P| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/ sizeo f563( | T ) ) {s t e| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S i z| e group(group( ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m655.:c11o:m mnote: .in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereb uffSi z655e | s [ N C C L _ P R O TpOr_iSmIsM(PtLiEd]-/tNiCdCSLt_aSrTtERPeSd/usciez,e onfT(hTr)e)a d{s R e| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u c e| , group(group nullptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626&:d9i:r enote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ->out ,626 | a r g s - > s e npdrbiumfsf(,t iadr-gtsi-d>SrteacrvtbSucfaft,t e r| , ^ nThread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:S202c:a53t:t er, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dStartBcast, nThreadsBcast, &direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562o:u15t:, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]u llptr, a r562g | s - > s etniddb(utfifd,) ,a rngtsh-r>eraedcsv(bnutfhfr,e a d| s ^) , tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g r202o | u p ( g r o u p )R,u n W| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k E| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ment <563F | n , T ,s tReepdSOipz,e (Anlcgcol,S hPmreomt.oc>o(m)m..rbuunf(fwSei)z;e s [| N ^C CL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppT:O12_:S1I:M Pnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereE ]/NC C12L | _ISMTPELP_SC/OsLiLz_eFoUfN(CT()A)l l{R e d| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c e ,| group(groupC OLLNET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:R666E:C9T:, note: Sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI MPLE ,666 | S u m , d o u bplrei)m s (| t^i d, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:h391r:e95a:d snote: Gexpanded from macro 'IMPL_COLL_FUNC'a ther, 391d | i r eRcutn-W>ourpk,< nNcUcLlLF,u nacr#g#sf-u>nsce,n dtbyupfef,, Faurngcs#-#>dreevcrvebduofpf<,t y p| e ^> , NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:A202L:G53O:_ #note: #in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea lgo, 202N | C C L _ P R O T OR_u#n#WporroktEol>e(m)e.nrtu().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562w:e15):; note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp | : 12 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid) ,12 | nItMhPrLe_aCdOsL(Ln_tFhUrNeCa(dAsl)l,R etdiudcIen,B lCoOcLkL(NtEhTr_eDaIdRIEdCxT.,x )S,I MgPrLoEu,p (Sgurmo,u pd)o,u b l| e ^~~~~~~~~~~~~~~~~) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h60::391 :note: 95field 'group' will be initialized after field 'stepSize': note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dR(utniWorkd,I dNxC.CxL)_,A LgGrOo_u#p#(aglrgoou,p )N,C C L| _ ^~~~~~~~~~~P ROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 562:15: 12warning: | initializer order does not match the declaration order [-Wreorder-ctor]I MPL_COL L562_ | F U N C (tAildl(Rteiddu)c,e ,n tChOrLeLaNdEsT(_nDtIhRrEeCaTd,s )S,I MtPiLdEI,n BSluomc,k (dtohurbelaed)I d x| .^x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95(:g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~391 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R unWor k563< | n c c lsFtuenpcS#i#zfeu(nncc,c ltSyhpmee,m .Fcuonmcm#.#bduefvfrSeidzoeps<[tNyCpCeL>_,P RNOCTCOL__SAILMGPOL_E#]#/aNlCgCoL,_ SNTCECPLS_/PsRiOzTeOo_f#(#Tp)r)o t{o > (| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. r u| n group(group( &ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m641.:w11o:r knote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here; \ | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: pfield 'nthreads' will be initialized after field 'tidInBlock'r ims(ti d562- | t i d S ttairdt(Rteiddu)c,e ,n tnhTrheraedasd(snRtehreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ duce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:u15c:e ,warning: initializer order does not match the declaration order [-Wreorder-ctor]C OLLNET_DIR E562C | T , S ItMiPdL(Et,i dS)u,m ,n trhcrcela_dbsf(lnotahtr1e6a)d s )| ,^ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B391l:o95c:k (note: texpanded from macro 'IMPL_COLL_FUNC'h readI d391x | . x )R,u ngWroorukp<(ngcrcoluFpu)n,c # #| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n c| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) type, 563F | u n c # #sdteevprSeidzoep(h,m eNmC.CcLo_mAmL.GbOu_f#f#Sailzgeos,[ NNCCCCLL__PPRROOTTOO__S#I#MpPrLoEt]o/>N(C)C.Lr_uSnT(E&PnSc/csliSzhemoefm(.Tw)o)r k{) ; | \ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h15::655 :note: 11field 'nthreads' will be initialized after field 'tidInBlock': note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 655t | i d ( t i d ) , n tphrriemasd(st(indt-htriedaSdtsa)r,t RteidduIcneB,l oncTkh(rtehardesaRdeIdduxc.ex,) ,n uglrloputpr(,g r&oduipr)e,c t -| > ^~~~~~~~~~~~~~~~~o ut/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r60g:s -note: >field 'group' will be initialized after field 'stepSize's endb u562f | f , a rtgisd-(>triedc)v,b unftfh,r e a| d ^s (nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Bloc k202( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrto().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ C(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,n tNhCrCeLa_dAsL(GnOt_h#r#eaaldgso),, NtCiCdLI_nPBRlOoTcOk_(#t#hprreoatdoI>d(x)..xr)u,n (g&rnocucpl(Sghrmoeump.)w,o r k| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~; \| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :s15t:e pnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'i ze(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h655::56211::60 :note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: field 'group' will be initialized after field 'stepSize' 655562 | | t i d ( t ipdr)i,m sn(tthirde-atdisd(SnttahrrteRaeddsu)c,e ,t indTIhnrBelaodcskR(etdhurceea,d Induxl.lxp)t,r ,g r&oduipr(egcrto-u>po)u,t , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hid(tid:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthreads), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~n Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h677::64111::11 :note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | 641 | p r ipmrsi(mtsi(dt-itdi-dtSitdaSrttaBrctaRsetd,u cneT,h rneTahdrseBacdassRte,d u&cdei,r edcitr-e>cotu-t>,d odwinr,e c&td-i>rdeocwtn-,> oaurtg,s -a>rsgesn-d>bsuefnfd,b uafrfg,s -a>rrgesc-v>bruefcfv,b u f| f ^, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202202: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here Run W202o | r k E l e m e n tR,( )A.lrguon,( wPer)o;t o >| ( ^) .run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppw:e13):;1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppP:L13_:C1O:L Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereF UNC(Al l13R | eIdMuPcLe_,C OCLOLL_LFNUENTC_(DAIlRlERCeTd,u cSeI,M PCLOEL,L NSEuTm_,D IrRcEcClT_,b fSlIoMaPtL1E6,) S u| m^, rc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:l391_:b95f:l onote: aexpanded from macro 'IMPL_COLL_FUNC't 16) | 391^ | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r391k:<95n:c cnote: lexpanded from macro 'IMPL_COLL_FUNC'F unc##fun c391, | t yRpuen,W oFruknt,y pNeC,C LF_uAnLcG#O#_d#e#valrgeod,o pNR,O TNOC_C#L#_pArLoGtOo_>#(#)a.lrguon,( &NnCcCcLl_SPhRmOeTmO._w#o#rpkr)o;t o\> ( )| . ^r un(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m enote: mfield 'nthreads' will be initialized after field 'tidInBlock'. work )562; | \ | t ^i d(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~k (th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60I:d xnote: .field 'group' will be initialized after field 'stepSize'x ), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorks,( nNtChCrLe_aAdLsG)O,_ #t#iadlIgnoB,l oNcCkC(Lt_hPrReOaTdOI_d#x#.pxr)o,t og>r(o)u.pr(ugnr(o&unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m .| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk); \563 | | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562i:z15: note: field 'nthreads' will be initialized after field 'tidInBlock' e(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::677562::1160:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herefield 'group' will be initialized after field 'stepSize' 562677 | | t i d ( t i dp)r,i mnst(htrieda-dtsi(dnSttharretaBdcsa)s,t ,t indTIhnrBelaodcskB(ctahsrte,a d&Iddixr.exc)t,- >goruotu,p (dgirroeucpt)-,> d o| w ^~~~~~~~~~~n , args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-tidS:t562a:r15t:B cwarning: ainitializer order does not match the declaration order [-Wreorder-ctor]s t, nThreadsB c562a | s t , &tdiidr(etcitd-)>,o untt,h rdeiardesc(tn-t>hdroewand,s )a,r gtsi-d>IsneBnldobcukf(ft,h raeragdsI-d>xr.exc)v,b ugfrfo,u p (| g ^r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :563 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ste p202S | i z e ( n c c l SRhumneWmo.rckoEmlme.mbeunftfC(C)L._rSuTnE(PwSe/)s;i z e| o ^f (T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 13| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~1 : | note: group(groupin instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:L655L:_11F:U Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( AllRe d655u | c e , C O L L N E Tp_rDiImRsE(CtTi,d -StIiMdPSLtEa,r tSRuemd,u crec,c ln_Tbhfrleoaadts1R6e)d u c| e^, nu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:l391p:t95r:, note: &expanded from macro 'IMPL_COLL_FUNC'd irect -391> | o u tR,u naWrogrsk-<>nscecnldFbuunfcf#,# faurngcs,- >tryepcev,b uFfufn,c # #| d ^e vredop:, note: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC CL_A L202G | O _ # # a l g o ,R uNnCWCoLr_kPERlOeTmOe_n#t#,( )R.erduOnp(,& nAclcgloS,h mPermo.twoo>r(k)).;r u\n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :note: 13field 'nthreads' will be initialized after field 'tidInBlock': 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 13 | tIiMdP(Lt_iCdO)L,L _nFtUhNrCe(aAdlsl(Rnetdhurceea,d sC)O,L LtNiEdTI_nDBIlRoEcCkT(,t hSrIeMaPdLIEd,x .Sxu)m,, grrcoculp_(bgfrloouapt)1,6 ) | ^~~~~~~~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::60391:: 95note: :field 'group' will be initialized after field 'stepSize' note: 562expanded from macro 'IMPL_COLL_FUNC' | t i391d | ( t iRdu)n,W onrtkhg,r oNuCpC(Lg_rAoLuGpO)_,# # a| l ^~~~~~~~~~~g o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r15u:n (warning: winitializer order does not match the declaration order [-Wreorder-ctor]e ); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 13t:i1d:( tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered ), n t13h | rIeMaPdLs_(CnOtLhLr_eFaUdNsC)(,A ltliRdeIdnuBcleo,c kC(OtLhLrNeEaTd_IDdIxR.ExC)T,, gSrIoMuPpL(Eg,r oSuupm),, r c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l _ b| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l oat16 )563 | | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:S391i:z95e:( nnote: cexpanded from macro 'IMPL_COLL_FUNC'c lShmem .391c | o m mR.ubnuWfofrSki),) N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A L G| O group(group_ ##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:O687_:#11#:p rnote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o>(). r687u | n ( & n c c l S h m epmr.iwmosr(kt)i;d -\t i d| S ^t artBc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:s562t:,15 :n Tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadsB c562a | s t , &tdiidr(etcitd-)>,o untt,h rneualdlsp(tnrt,h raeragdss-)>,s etniddbIunfBfl,o cakr(gtsh-r>eraedcIvdbxu.fxf),, g| r ^o up(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^~~~~~~~~~~~~~~~~53 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 562:60: 202note: | field 'group' will be initialized after field 'stepSize' 562 | R u n W otrikdE(lteimde)n,t l(o)c.kr(utnh(rweea)d;I d x| . ^x ), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp(:g13r:o1u:p )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~ 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^ :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::15391:: 95warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: expanded from macro 'IMPL_COLL_FUNC' 391 | 562R | u n W o rtkie,a dNICdCxL._xA)L,G Og_r#o#uapl(ggor,o uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O T O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #prot o563> | ( ) . r usnt(e&pnSciczleS(hnmcecml.Swhomrekm).;c o\m m .| b ^u ffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562[:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'P ROTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:d666x:.9x:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup( g666r | o u p ) , | ^~~~~~~~~~~~~~~~~p rim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562t:i60d:, note: nfield 'group' will be initialized after field 'stepSize'T hread s562G | a t h e rt,i dd(itriedc)t,- >nutph,r eNaUdLsL(,n tahrrgesa-d>ss)e,n dtbiudfIfn,B laorcgks(-t>hrreecavdbIudfxf.,x ) ,| ^g roup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202):,53 : | note: ^~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:S562i:z15e:s [warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_PROTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h677r:e11a:d Inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex .x), g r677o | u p ( g r o u p ) , p r| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m s (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d-tid S563t | a r t B csatsetp,S inzTeh(rnecacdlsSBhcmaesmt.,c o&mdmi.rbeucftf-S>iozuets,[ NdCiCrLe_cPtR-O>TdOo_wSnI,M PaLrEg]s/-N>CsCeLn_dSbTuEfPfS,/ sairzgeso-f>(rTe)c)v b{u f f| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 687in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here202 | 687 | R u n W o r k E l epmreinmts<(Ftni,d -Tt,i dRSetdaOrpt,B cAalsgto,, nPTrhorteoa>d(s)B.crausnt(,w e&)d;i r e| c ^t ->out, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppn:u13l:l1p:t rnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here args -13> | sIeMnPdLb_uCfOfL,L _aFrUgNsC-(>ArlelcRvebduufcfe,, C| O ^L LNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:C202T:,53 :S Inote: Min instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP LE, S202u | m , r c c l _ bRfulnoWaotr1k6E)l e m| e^n tk(<)n.crculnF(uwnec)#;# f u| n ^c , type,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :F13u:n1c#:# dnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herev redo p13< | tIyMpPeL>_,C ONLCLC_LF_UANLCG(OA_l#l#Raeldguoc,e NCCL,_ PCROOLTLON_E#T#_pDrIoRtEoC>T(,) .SrIuMnP(L&En,c cSluSmh,m ermc.cwlo_rbkf)l;o a\t 1 6| ) ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h15::391 :note: 95field 'nthreads' will be initialized after field 'tidInBlock': note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dR(utniWdo)r,k ),, NgCrCoLu_pA(LgGrOo_u#p#)a,l g o| , ^~~~~~~~~~~~~~~~~ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R60O:T Onote: _field 'group' will be initialized after field 'stepSize'# #pro t562o | > ( ) . rtuind((&tnicdc)l,S hnmtehmr.ewaodrsk()n;t h\r e a| d ^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~n threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hd:)514,: 9n:t hwarning: rvariable 'offset' set but not used [-Wunused-but-set-variable]e ads (514n | t h r e aidnst) ,o ftfisdeItn B=l otcikd(;t h r| e ^a dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_SIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ S/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppn:t1h: rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ha:d10s: ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h&:r167i: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:-562>:p15r:e vwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] &ring->ne x562t | , a r gtsi-d>(steindd)b,u fnft,h raeragdss-(>nrtehcrvebaudfsf),, atrigdsI-n>BrleodcOkp(Atrhgr,e a0d,I daxr.gxs)-,> cgornonuIpn(dgerxo,u pa)r,g s -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c o n| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I ndex )563; | | ^ stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:z80e:(5n:c cnote: lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereS hme m80. | c o m m .rbuunfRfiSnigzM(PaLrEg]s/)N;C C L| _ ^S TEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:o53f:( Tnote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~202 | | group(group RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:k34E:l7e:m enote: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret t(h)r.eraudns(,w e&)r;i n g| - ^> prev, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp&:r4i:n1g:- >note: nin instantiation of member function 'RunWork, 1, 2>::run' requested heree xt, a4r | gIsM-P>Ls_eCnOdLbLu_fFfU,N Ca(rRgesd-u>cree,c vRbIuNfGf,, SaIrMgPsL-E>,r ePdrOopdA,r gi,n t08,_ ta)r g s| -^> connI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:d391e:x95,: anote: rexpanded from macro 'IMPL_COLL_FUNC'g s->conn I391n | d e xR)u;n W o| r ^k , ProtoSimple<1, 1>>' requested here, t y80p | e , F urnucn#R#idnegvP,r oNtCoC>L(_aArLgGsO)_;# # a| l ^g o, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:P53R:O Tnote: Oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here_ ##p r202o | t o > ( ) . r u nR(u&nnWcocrlkSEhlmeemme.nwto().ru n562( | w e ) ; t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp :n4t:h1r: enote: ain instantiation of member function 'RunWork, 1, 2>::run' requested hered s(n t4h | rIeMaPdLs_)C,O LtLi_dFIUnNBCl(oRcekd(utcher,e aRdIINdGx,. xS)I,M PgLrEo,u pP(rgordo,u pi)n,t 8 _| t ^~~~~~~~~~~~~~~~~) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h60::391 :note: 95field 'group' will be initialized after field 'stepSize': note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t iRdu(ntWiodr)k,< nnctchlrFeuandcs#(#nftuhnrce,a dtsy)p,e ,t iFduInncB#l#odcekv(rtehdroepax,) ,N CgCrLo_uApL(GgOr_o#u#pa)l,g o ,| ^~~~~~~~~~~N CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]x .x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s (note: nfield 'group' will be initialized after field 'stepSize't hread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ( t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadId x563. | x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~m .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t idInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.60x:) ,note: field 'group' will be initialized after field 'stepSize'g roup(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdiIzdexs.[xN)C,C Lg_rPoRuOpT(Og_rSoIuMpP)L,E ] /| N ^~~~~~~~~~~C CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(nccl S563h | m e m . csotmemp.Sbiuzfef(SniczcelsS[hNmCeCmL._cPoRmOmT.Ob_uSfIfMSPiLzEe]s/[NNCCCCLL__SPTREOPTSO/_sSiIzMePoLfE(]T/)N)C C{L _ S| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E P S| / group(groups izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h34::347::7 :note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 3434 | | pprriimmss((ttiidd,, nntthhrreeaaddss,, &&rriinngg-->>pprreevv,, &&rriinngg-->>nneexxtt,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, aarrggss-->>rreeddOOppAArrgg,, 00,, aarrggss-->>ccoonnnnIInnddeexx,, aarrggss-->>ccoonnnnIInnddeexx));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h::8080::55:: note: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herein instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | 80 | r urnuRniRnigno(>a(ragrsg)s;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | 202 | R u nRWuonrWkoErlkeEmleenmte().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uintIn file included from 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp4:_1t: *In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:t10r: In file included from =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :r169e: c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hv:P271t:r19(:0 )warning: +unused variable 'ptr' [-Wunused-variable]l l128Off s271e | t ; | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hinitializer order does not match the declaration order [-Wreorder-ctor]: 916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 916 | t i d (ptriidm)s,( gnrtohurpeTaidds,( ngtrhoruepaNdtsh)r,e atdisd,I n&Brleoccvk,( t&hsreenadd,I daxr.gxs)-,> sgernodubpu(fgfr,o uapr)g,s - >| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e c v| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ff, | ^563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:t202e:p53S:i znote: ein instantiation of member function 'RunWorkElement, 3, 2>::run' requested here( ncc l202S | h m e m . c o m mR.ubnuWfofrSkiEzleesm[eNnCtC/(s)i.zreuonf((wTe))); { | ^| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp group(group: 10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 91610: | 7I:M Pnote: Lin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here_ COLL_F U916N | C ( A l l R epdruicmes,( gCrOoLuLpNTEiTd_,C HgArIoNu,p NStIhMrPeLaEd,s ,M i&nr,e chva,l f&)s e n| d^, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:-391>:s95e:n dnote: bexpanded from macro 'IMPL_COLL_FUNC'u ff, a r391g | s - >RruencWvobrukf, 3, 2>::run' requested here, Fun c202# | # d e v r e d o pRk,E lNeCmCeLn_tAr(o)t.or>u(n)(.wreu)n;( & n| c ^c lShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cppw:o11r:k1):; note: \in instantiation of member function 'RunWork, 3, 2>::run' requested here | ^ 11 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_15C:O Lnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ FUNC( A562l | l R e d utcied,( tCiOdL)L,N EnTt_hCrHeAaIdNs,( nStIhMrPeLaEd,s )M,i nt,i dfIlnoBalto)c k (| t^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(g r391o | u p )R,u n W| o ^~~~~~~~~~~~~~~~~r kt,h rNeCaCdLs_)A,L GtOi_d#I#naBllgooc,k (NtChCrLe_aPdRIOdTxO._x#)#,p rgortoou>p(()g.rrouunp()&,n c c| l ^~~~~~~~~~~S hmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp\: 1 : | In file included from ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::562169:: 15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:: 271note: :field 'nthreads' will be initialized after field 'tidInBlock'19 : warning: unused variable 'ptr' [-Wunused-variable] 562 | tid( t271i | d ) , n t h r euaidnst(6n4t_htr*e apdtsr) ,= triedcIvnPBtlro(c0k)(+tlhlr1e2a8dOIfdfxs.ext);, g| r ^~~o up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s15[:N Cwarning: CLinitializer order does not match the declaration order [-Wreorder-ctor]_ PROTO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hh:r34e:a7d:I dnote: xin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. x), gro u34p | ( g r o u p )p,r i m| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), nthr e563a | d s , &srtienpgS-i>zper(envc,c l&Srhimnegm-.>cnoemxmt.,b uafrfgSsi-z>esse[nNdCbCuLf_fP,R OaTrOg_sS-I>MrPeLcEv]b/uNfCfC,L _aSrTgEsP-S>/rseidzOepoAfr(gT,) )0 ,{ a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - >| c group(groupo nnIndex/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 34a:r7g:s -note: >in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec onnInde x34) | ; | ^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:(80t:i5d:, note: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heret hre a80d | s , & rriunngR-i>npgrPnreoxtto,> (aarrggss-)>;s e n| d ^b uff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s202-:>53recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _warning: Ainitializer order does not match the declaration order [-Wreorder-ctor]L GO_##algo ,562 | N C C L _tPiRdO(TtOi_d#)#,p rnotthor>e(a)d.sr(unnt(h&rnecacdlsS)h,m etmi.dwIonrBkl)o;c k\( t h| r ^e adIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock'( group), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(ti d563) | , n t hreads(nthreads), tidsItneBplSoiczke((tnhcrcelaSdhImdexm..xc)o,m mg.rbouufpf(Sgirzoeusp[)N,C C L| _ ^~~~~~~~~~~~~~~~~P ROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P60L:E ]note: /field 'group' will be initialized after field 'stepSize'N CCL_STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:)34,: 7t:i dnote: Iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Block (34t | h r e a d I dpxr.ixm)s,( tgirdo,u pn(tghrroeuapd)s,, &| r ^~~~~~~~~~~i ng->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :warning: 202initializer order does not match the declaration order [-Wreorder-ctor]: 53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 562202 | | t i d ( tRiudn)W,o rnktEhlreemaednst(a(d)I.drxu.nx()w,e )g;r o u| p ^( grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppp:)4,: 1 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: in instantiation of member function 'RunWork, 1, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 4 | IMPL _563C | O L L _ FsUtNeCp(SRiezdeu(cnec,c lRSIhNmGe,m .ScIoMmPmL.Eb,u fSfuSmi,z eisn[tN8C_CtL)_ P R| O^T O_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P391L:E95]:/ Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'C L_STEP S391/ | s i zReuonfW(oTr)k)< n{c c l| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n c| # group(group# func, ty/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hp:e34,: 7F:u nnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #devredo p34< | t y p e > , pNrCiCmLs_(AtLiGdO,_ #n#tahlrgeoa,d sN,C C&Lr_iPnRgO-T>Op_r#e#vp,r o&troi>n(g)-.>rnuenx(t&,n cacrlgSsh-m>esme.nwdobrukf)f;, \a r g| s ^- >recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: anote: rfield 'nthreads' will be initialized after field 'tidInBlock'g s->re d562O | p A r g ,t i0d,( tairdg)s,- >nctohnrneIanddse(xn,t harregasd-s>)c,o ntniIdnIdneBxl)o;c k (| t ^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:x80.:x5):, note: gin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer oup (80g | r o u p )r,u n R| i ^~~~~~~~~~~~~~~~~n g ( a r gtsi)d;( t i| d ^) , nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heree ads), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~o , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562&:n15c:c lwarning: Sinitializer order does not match the declaration order [-Wreorder-ctor]h mem.work )562; | \ | t ^i d(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d x.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~c lShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c60o:m mnote: .field 'group' will be initialized after field 'stepSize'b uffSi z562e | s [ N C CtLi_dP(RtOiTdO)_,S InMtPhLrEe]a/dNsC(CnLt_hSrTeEaPdSs/)s,i zteiodfI(nTB)l)o c{k ( t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groupI dx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:o34u:p7(:g rnote: oin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p), | ^~~~~~~~~~~ 34 | prims(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s , &ring- >562p | r e v , t&irdi(ntgi-d>)n,e xntt,h raeragdss-(>nstehnrdebaudfsf),, atrigdsI-n>Brleoccvkb(utfhfr,e aadrIgdsx-.>xr)e,d OgprAorugp,( g0r,o uapr)g,s - >| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o n n| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n dex, a r563g | s - > c osntneIpnSdiezxe)(;n c c| l ^S hmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h.:c80o:m5m:. bnote: uin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested heref fSi z80e | s [ N C CrLu_nPRRiOnTgO<_TS,I MRPeLdEO]p/,N CPCrLo_tSoT>E(PaSr/gssi)z;e o f| ( ^T )) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 53 :| group(groupnote: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h | : 34 : 7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWork E34l | e m e n t < Fpnr,i mTs,( tRiedd,O pn,t hArlegaod,s ,P r&ortion>g(-)>.prruenv(,w e&)r;i n g| - ^> next,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp :a10r:g1s:- >note: sin instantiation of member function 'RunWork, 1, 2>::run' requested heree ndb u10f | fI,M PaLr_gCsO-L>Lr_eFcUvNbCu(fRfe,d uacreg,s -R>IrNeGd,O pSAIrMgP,L E0,, Saurmg,s -h>aclofn)n I n| d^e x, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:c onote: nexpanded from macro 'IMPL_COLL_FUNC'n Index) ;391 | | ^R unWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h<:n80c:c5l:F unote: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herec ##f u80n | c , t yrpuen,R iFnugn>(,a rNgCsC)L;_ A L| G ^O _##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202N:C53C:L _note: Pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereR OTO_ #202# | p r o t o > ( ) .RruunnW(o&rnkcEclleSmhemnetm<.Fwno,r kT),; R\e d O| p ^, Alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:,562 :P15r:o tnote: ofield 'nthreads' will be initialized after field 'tidInBlock'> ().ru n562( | w e ) ; t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp :n5t:h1r:e anote: din instantiation of member function 'RunWork, 1, 2>::run' requested heres (nt h5r | eIaMdPsL)_,C OtLiLd_IFnUBNlCo(cRke(dtuhcree,a dRIIdNxG.,x )S,I MgPrLoEu,p (Sgurmo,u pu)i,n t 8| _ ^~~~~~~~~~~~~~~~~t ) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'group' will be initialized after field 'stepSize': 391:95: note: 562expanded from macro 'IMPL_COLL_FUNC' | tid( t391i | d ) ,R unntWhorrekau,p (NgCrCoLu_pA)L,G O _| # ^~~~~~~~~~~# algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dR(utniWdo)r,k ),, NgCrCoLu_pA(LgGrOo_u#p#)a,l g o| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _PRO T563O | _ # # p rsotteop>S(i)z.er(unnc(c&lnSchcmleSmh.mceomm.mw.obrukf)f;S i\z e s| [ ^N CCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O15_:S Inote: Mfield 'nthreads' will be initialized after field 'tidInBlock'P LE]/N C562C | L _ S T EtPiSd/(stiizde)o,f (nTt)h)r e{a d s| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:I34n:B7l:o cnote: kin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( threadI d34x | . x ) , g rporuipm(sg(rtoiudp,) ,n t h| r ^~~~~~~~~~~~~~~~~e ads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562&:r60i:n gnote: -field 'group' will be initialized after field 'stepSize'> prev, 562& | r i n g -t>inde(xtti,d )a,r gnst-h>rseeanddsb(unftfh,r eaardgss)-,> rteicdvIbnuBflfo,c ka(rtghsr-e>ardeIddOxp.Axr)g,, g0r,o uapr(ggsr-o>ucpo)n,n I n| d ^~~~~~~~~~~e x, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | s562t | e p S i ztei(dn(ctcildS)h,m enmt.hcroemamd.sb(unftfhSriezaedss[)N,C CtLi_dPIRnOBTlOo_cSkI(MtPhLrEe]a/dNICdCxL._xS)T,E PgSr/osuipz(egorfo(uTp))), { | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| group(group 563 | stepSize(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:m916e:m7.:c onote: min instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herem .buff S916i | z e s [ N C CpLr_iPmRsO(TgOr_oSuIpMTPiLdE,] /gNrCoCuLp_NStThErPeSa/dssi,z e&orfe(cTv),) &{s e n| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f916f:,7 :a rnote: gin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested heres ->r e916c | v b u f f , p r| i ^m s(groupTi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:,202 :g53r:o unote: pin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereN thre a202d | s , & r e c v ,R u&nsWeonrdk,E laermgesn-t>lrgeoc,v bPurfoft,o > (| ) ^. run(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp :2024 | : 1 : note: in instantiation of member function 'RunWork, 3, 2>::run' requested here Run W4o | rIkMEPlLe_mCeOnLtL<_FFnU,N CT(,A lRleRdeOdpu,c eA,l gCoO,L LPNrEoTt_oC>H(A)I.Nr,u nS(IwMeP)L;E , | M ^a x, int8_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cppt:)6 : 1| :^ note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 956: | Inote: Mexpanded from macro 'IMPL_COLL_FUNC'P L_COL L391_ | F U NRCu(nAWlolrRke , | N^C CL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:G391O:_95#:# anote: lexpanded from macro 'IMPL_COLL_FUNC'g o, NCC L391_ | P R ORTuOn_W#o#rpkrlFunc#(#)f.urnucn,( &tnycpcel,S hFmuenmc.#w#odrekv)r;e d\o p <| t ^y pe>, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lnote: gfield 'nthreads' will be initialized after field 'tidInBlock'o , NCC L562_ | P R O T Ot_i#d#(ptriodt)o,> (n)t.hrruena(d&sn(cnctlhSrhemaedms.)w,o rtki)d;I n\B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~t id),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x60.:x )note: ,field 'group' will be initialized after field 'stepSize' group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l o c| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( threa d563I | d x . x )s,t egprSoiuzpe((gnrcoculpS)h,m e m| . ^~~~~~~~~~~c omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sen17 warnings generated when compiling for gfx803. dbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp::3861:: 9In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :warning: 10variable 'wireOffset' set but not used [-Wunused-but-set-variable]: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169 : 386/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h | : 271 : 19 :i nwarning: tunused variable 'ptr' [-Wunused-variable] wireOff s271e | t = W i r e WuoirndtP6e4r_Stl*i cpet*rw a=r pr e+c v2P*twri(d0;) + l| l ^1 28Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562(:A15l:l Rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]d uce, COLL N562E | T _ C H AtIiNd,( tSiIdM)P,L En,t hPrreoadd,s (hnatlhfr)e a d| s^) , tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B391l:o95c:k (note: texpanded from macro 'IMPL_COLL_FUNC'h readId x391. | x ) ,R ugnrWoourpk(m,e mN.CcCoLm_mA.LbGuOf_f#S#iazlegso[,N CNCCLC_LP_RPORTOOT_OS_I#M#PpLrEo]t/oN>C(C)L._rSuTnE(P&Sn/cscilzSehomfe(mT.)w)o r{k ) ;| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\ | | group(group ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::916562::715:: note: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 916 | 562 | tpirdi(mtsi(dg)r,o unptThirde,a dgsr(onutphNrtehardesa)d,s ,t i&drIencBvl,o c&ks(etnhdr,e aadrIgdsx-.>xs)e,n dgbruofufp,( garrogusp-)>,r e c| v ^~~~~~~~~~~~~~~~~b uff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here tid( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 563: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]s tepSize( n562c | c l S h mteimd.(ctoimdm).,b unftfhSriezaedss[(NnCtChLr_ePaRdOsT)O,_ StIiMdPILnEB]l/oNcCkC(Lt_hSrTeEaPdSI/dsxi.zxe)o,f (gTr)o)u p{( g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p )| , group(group | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655: 11563: | note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here step S655i | z e ( n c c l S h m epmr.icmosm(mt.ibdu-ftfiSdiSzteasr[tNRCeCdLu_cPeR,O TnOT_hSrIeMaPdLsER]e/dNuCcCeL,_ SnTuElPlSp/tsri,z e&odfi(rTe)c)t -{> o u| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f655f:,11 :a rnote: gin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ->recv b655u | f f , | ^ prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:-53t:i dnote: Sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret artRe d202u | c e , n T h r eRaudnsWRoerdkuEclee,m ennutlAolugto,, aPrrgost-o>>s(e)n.drbuunf(fw,e )a;r g s| - ^> recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppf:,4 : 1| : ^ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h4: | 202I:M53P:L _note: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO LL_F U202N | C ( A l l R e d uRcuen,W oCrOkLELlNeEmTe_nDtI ( )| .^r un(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:)391;: 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp: 4391: | 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work <4n | cIcMlPFLu_nCcO#L#Lf_uFnUcN,C (tAylpleR,e dFuucnec,# #CdOeLvLrNeEdTo_pD, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement ().run(we); 562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:d4):,1 :n tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eads (4n | tIhMrPeLa_dCsO)L,L _tFiUdNICn(BAllolcRke(dtuhcree,a dCIOdLxL.NxE)T,_ DgIrRoEuCpT(,g rSoIuMpP)L,E , | M ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i n ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i nt8_t )563 | | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem.c o391m | m . bRuufnfWSoirzke ,{ N C| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ A| L group(groupG O_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:O666T:O9_:# #note: pin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oto>( )666. | r u n ( & n c c lpSrhimmesm(.twiodr,k )n;T h\r e a| d ^s Gathe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:,562 :d15i:r enote: cfield 'nthreads' will be initialized after field 'tidInBlock't ->up, 562N | U L L , tairdg(st-i>ds)e,n dnbtuhfrfe,a dasr(gnst-h>rreeacdvsb)u,f ft,i d I| n ^B lock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here grou p202( | g r o u p ) , R| u ^~~~~~~~~~~~~~~~~n Work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:l562e:m60e:n tnote: r(e)a.drsu(nn(twher)e;a d s| ) ^, tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:c4k:(1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea dIdx .4x | )I,M PgLr_oCuOpL(Lg_rFoUuNpC)(,A l l| R ^~~~~~~~~~~e duce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBloIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppk:(1t: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhIn file included from :r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562e::a1015d: :IIn file included from dwarning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hxinitializer order does not match the declaration order [-Wreorder-ctor]:. 167x: )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g r o562u | p ) ,562 | t| i ^~~~~~~~~~~~~~~~~ d (tt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hii:dd562():t,60 i:nd t)note: h,field 'group' will be initialized after field 'stepSize'r enatd hs562r( | en at dh sr (etnaitdhd(rste)i,ad d)ts,i) d,nI ttnhiBrdleIoandBcskl((ontcthkhr(reteaharddeIsad)dx,I. dxtx)i.,dx I)gn,rB olguorpco(kug(prt(ohgurrpeo)au,dp I) d,| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o u p563( | g 563r | o u ps )t ,es pt Se| ip ^~~~~~~~~~~zS ei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 626note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here9 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | 626 | pprriimmss((ttiidd--ttiiddSSttaarrttSBccaatstte,r ,n TnhTrheraedasdBscSacsatt,t er, NU&LdLi,r edcitr-e>cotu-t>,u pn,u lalrpgtsr-,> saerngdsb-u>fsfe,n dabrugfsf-,> raercgvsb-u>frfe,c v b| u ^f f, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n W o r k ERluenmWeonrtk,( )P.rroutno(>w(e)).;r u n| ( ^w e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :note: 4in instantiation of member function 'RunWork, 2, 2>::run' requested here: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here4 | IMP L4_ | CIOMLPLL__FCUONLCL(_AFlUlNRCe(dAulcleR,e dCuOcLeL,N ECTO_LDLINREETC_TD,I RSEICMTP,L ES,I MMPiLnE,, iMnitn8,_ ti)n t 8| _^t ) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h^: 391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391expanded from macro 'IMPL_COLL_FUNC': 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R391u | n W oRruknt,y pNeC>C,L _NACLCGLO__A#L#GaOl_g#o#,a lNgCoC,L _NPCRCOLT_OP_R#O#TpOr_o#t#op>r(o)t.or>u(n)(.&rnucnc(l&SnhcmcelmS.hwmoermk.)w;o r\k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :note: 15field 'nthreads' will be initialized after field 'tidInBlock': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::60 :note: field 'group' will be initialized after field 'stepSize'note: field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c15o:m mwarning: .initializer order does not match the declaration order [-Wreorder-ctor]b uffSizes[NCC L562_ | P R O T Ot_iSdI(MtPiLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) )t i{d I n| B ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l o c| k group(group( threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626g:r9o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup) ,626 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) prims (563t | i d - t isdtSetpaSritzSec(antctcelrS,h mneTmh.rceoamdms.SbcuaftftSeirz,e sN[UNLCLC,L_ PdRiOrTeOc_tS-I>MuPpL,E ]a/rNgCsC-L>_sSeTnEdPbSu/fsf,i zaerogfs(-T>)r)e c{v b u| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f , | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h53::655 :note: 11in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 202 | 655 | R u n W o r k E l epmreinmts<(Ftni,d -Tt,i dRSetdaOrpt,R eAdlugcoe,, PnrTohtroe>a(d)s.Rreudnu(cwee,) ;n u l| l ^p tr, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppd:i4r:e1c:t -note: >in instantiation of member function 'RunWork, 2, 2>::run' requested hereo ut, 4a | rIgMsP-L>_sCeOnLdLb_uFfUfN,C (aArlglsR-e>drueccev,b uCfOfL,L N E| T ^_ DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I202M:P53L:E ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereM in, i n202t | 8 _ t ) | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hW:o391r:k95E:l enote: mexpanded from macro 'IMPL_COLL_FUNC'e ntf(u)n.cr,u nt(ywpee),; F u| n ^c ##devr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppe:d4o:p1<:t ynote: pin instantiation of member function 'RunWork, 2, 2>::run' requested heree >, N C4C | LI_MAPLLG_OC_O#L#La_lFgUoN,C (NAClClLR_ePdRuOcTeO,_ #C#OpLrLoNtEoT>_(D)I.RrEuCnT(,& nScIcMlPSLhEm,e mM.iwno,r ki)n;t 8\_ t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::391562::9515:: note: note: expanded from macro 'IMPL_COLL_FUNC'field 'nthreads' will be initialized after field 'tidInBlock' 562 | 391 | RtuindW(otrikd<)n,c cnltFhurneca#d#sf(unntch,r etaydpse, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:C562O:L15L:_ Fwarning: Uinitializer order does not match the declaration order [-Wreorder-ctor]N C(AllRedu c562e | , C O LtLiNdE(Tt_iDdI)R,E CnTt,h rSeIaMdPsL(En,t hMriena,d si)n,t 8t_itd)I n B| l^o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:I dnote: xexpanded from macro 'IMPL_COLL_FUNC'. x), gr o391u | p ( gRruonuWpo)r,k < n| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c l F| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n c##f u563n | c , t ysptee,p SFiuznec(#n#cdcelvSrhemdeomp.b,u fNfCSCiLz_eAsL[GNOC_C#L#_aPlRgOoT,O _NSCICMLP_LPER]O/TNOC_C#L#_pSrToEtPoS>/(s)i.zreuonf((&Tn)c)c l{S h m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m . w| o group(groupr k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h11::562 :note: 15in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: field 'nthreads' will be initialized after field 'tidInBlock' 641 | 562 | t i d (ptriidm)s,( tnitdh-rteiaddSst(anrtthRreeduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 4562 | :I15M:P Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]C OLL_FUN C562( | A l l R etdiudc(et,i dC)O,L LnNtEhTr_eDaIdRsE(CnTt,h rSeIaMdPsL)E,, tMiidnI,n Bilnotc8k_(tt)h r e| a^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391g:r95o:u pnote: (expanded from macro 'IMPL_COLL_FUNC'g roup), | 391 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nWor k563< | n c c l Fsutnecp#S#ifzuen(cn,c ctlySphem,e mF.ucnocm#m#.dbeuvfrfeSdiozpeL,_ PNRCOCTLO__ASLIGMOP_L#E#]a/lNgCoC,L _NSCTCELP_SP/RsOiTzOe_o#f#(pTr)o)t o{> ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(group& ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m655.:w11o:r knote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here; \ | ^655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15p:r inote: mfield 'nthreads' will be initialized after field 'tidInBlock's (tid-ti d562S | t a r t Rteiddu(ctei,d )n,T hnrtehardesaRdesd(uncteh,r enaudlsl)p,t rt,i d&IdniBrleocctk-(>tohurte,a daIrdgxs.-x>)s,e ngdrbouufpf(,g raorugps)-,> r e| c ^~~~~~~~~~~~~~~~~v buff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^60 : note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | 202t | i d ( t i d ) , RnutnhWroerakdEsl(enmtehnrte.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_A L562G | O _ # # atligdo(,t iNdC)C,L _nPtRhOrTeOa_d#s#(pnrtohtroe>a(d)s.)r,u nt(i&dnIcncBllSohcmke(mt.hwroerakd)I;d x\. x )| , ^ group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~~~~~~~o f(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):)562 :{60 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' | group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t687i:d11(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nth r687e | a d s ( n t h r e a dpsr)i,m st(itdiIdn-BtliodcSkt(atrhtrBecaadsItd,x .nxT)h,r egardosuBpc(agsrto,u p&)d,i r e| c ^~~~~~~~~~~t ->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c l S| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m em.comm .563b | u f f S iszteesp[SNiCzCeL(_nPcRcOlTSOh_mSeImM.PcLoEm]m/.NbCuCfLfS_iSzTeEsP[SN/CsCiLz_ePoRfO(TTO)_)S I{M P L| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~] / N| C group(groupC L_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:o677f:(11T:) )note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 677| | group(group pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:s677(:t11i:d -note: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei dStartB c677a | s t , n T h r e a dpsrBicmass(tt,i d&-dtiirdeSctta-r>toBucta,s td,i rneTchtr-e>addoswBnc,a satr,g s&-d>isreencdtb-u>fofu,t ,a rdgisr-e>crte-c>vdbouwfnf,, a r| g ^s ->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:b202u:f53f:, note: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer gs- >202r | e c v b u f f , R u| n ^W orkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:<202F:n53,: Tnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here RedOp, 202A | l g o , P r o tRou>n(W)o.rrkuEnl(ewmee)n;t < F| n ^, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppe:d4O:p1,: Anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, P4r | oItMoP>L(_)C.OrLuLn_(FwUeN)C;( A l| l ^R educe, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppL:N4E:T1_:D Inote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested hereE CT, S I4M | PILMEP,L _MCiOnL,L _iFnUtN8C_(tA)l l R| e^d uce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391C:O95L:L Nnote: Eexpanded from macro 'IMPL_COLL_FUNC'T _DIRE C391T | , SRIuMnPWLoEr,k ,R uNnCWCoLr_kAv(r)e.droupn<(t&ynpcec>l,S hNmCeCmL._wAoLrGkO)_;# #\a l g| o ^, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R15O:T Onote: _field 'nthreads' will be initialized after field 'tidInBlock'# #pr o562t | o > ( ) .triudn((t&indc)c,l Snhtmherme.awdosr(kn)t;h r\e a d| s ^) , tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~a ds/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's ), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~~~~~~~n Block/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r60e:a dnote: Ifield 'group' will be initialized after field 'stepSize'd x.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhread:Id562x:.15x:) ,warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup(group), 562| | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCCL_ P562R | O T O _ #t#ipdr(ottiod>)(,) .nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d ), nt h563r | e a d s (snttehprSeiazdes()n,c ctliSdhImneBml.occokm(mt.hbruefafdSIidzxe.sx[)N,C CgLr_oPuRpO(TgOr_oSuIpM)P,L E ]| / ^~~~~~~~~~~~~~~~~N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P60S:/ snote: ifield 'group' will be initialized after field 'stepSize'z eof(T )562) | { | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d (| t group(groupi d), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s626(:n9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds), t626i | d I n B l o c k (ptrhirmesa(dtIiddx-.txi)d,S tgarrotuSpc(agtrtoeurp,) ,n T h| r ^~~~~~~~~~~e adsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:s562t:,15 :& dwarning: iinitializer order does not match the declaration order [-Wreorder-ctor]r ect->out, n562u | l l p t rt,i da(rtgisd-)>,s enntdhbruefafd,s (anrtghsr-e>ardesc)v,b utfifd,I n B| l ^o ck(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here grou p202( | g r o u p ) , R| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n W o| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k Eleme n563t | < F n , sTt,e pRSeidzOep(,n cAcllgSoh,m ePmr.octoom>m(.)b.urfufnS(iwzee)s;[ N C| C ^L _PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppI:M5P:L1E:] /note: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereC CL_S T5E | PISM/PsLi_zCeOoLfL(_TF)U)N C{( A l| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| u group(groupc e, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:_641D:I11R:E Cnote: Tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, SIMPL E641, | M i n , u i n t 8p_rti)m s (| t^i d-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r391t:R95e:d unote: cexpanded from macro 'IMPL_COLL_FUNC'e , nThre a391d | s R eRduuncWeo,r kdndco#w#nf,u n&cd,i rteycpte-,> oFuutn,c #a#rdgesv-r>esdeonpd ,a rNgCsC-L>_rAeLcGvOb_u#f#fa,l g o| , ^ NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53#:# pnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo to>( )202. | r u n ( & n c c lRSuhnmWeomr.kwEolrekm)e;n t\< F n| , ^ T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:p562,: 15A:l gnote: ofield 'nthreads' will be initialized after field 'tidInBlock', Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpps:(5n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s), t5i | dIIMnPBLl_oCcOkL(Lt_hFrUeNaCd(IAdlxl.Rxe)d,u cger,o uCpO(LgLrNoEuTp_)D,I R E| C ^~~~~~~~~~~~~~~~~T , SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E60,: Mnote: ifield 'group' will be initialized after field 'stepSize'n , uint 8562_ | t ) | t^i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads(n t391h | r e aRdusn)W,o rtki, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i15z:e owarning: finitializer order does not match the declaration order [-Wreorder-ctor]( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d677):,11 :n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads( n677t | h r e a d s ) , t ipdrIinmBsl(otcikd(-tthirdeSatdaIrdtxB.cxa)s,t ,g rnoTuhpr(egardosuBpc)a,s t ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& d i| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ct->o u563t | , d i rsetcetp-S>idzoew(nn,c calrSghsm-e>ms.ecnodmbmu.fbfu,f faSrigzse-s>[rNeCcCvLb_uPfRfO,T O _| S ^I MPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:S Tnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP S/si z202e | o f ( T ) ) { R u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W o r| k group(groupE lement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO p, Al g666o | , P r o t o > (p)r.irmusn((twied),; n T| h ^r eadsG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppa:t5h:e1r:, note: din instantiation of member function 'RunWork, 2, 2>::run' requested herei rect -5> | uIpM,P LN_UCLOLL,L _aFrUgNsC-(>AslelnRdebduufcfe,, aCrOgLsL-N>ErTe_cDvIbRuEfCfT,, S| I ^M PLE, Mi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:,202 :u53i:n tnote: 8in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ t) 202| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391R:u95n:W onote: rexpanded from macro 'IMPL_COLL_FUNC'k Elemen t391< | F n ,R uTn,W oRrekd (t)y.preu,n (Fwuen)c;# # d| e ^v redop:, note: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereC CL_A L5G | OI_M#P#La_lCgOoL,L _NFCUCNLC_(PARlOlTROe_d#u#cper,o tCoO>L(L)N.ErTu_nD(I&RnEcCcTl,S hSmIeMmP.LwEo,r kM)i;n ,\ u i| n ^t 8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: 562note: | expanded from macro 'IMPL_COLL_FUNC' tid (391t | i d )R,u nnWtohrrkeo,u pN(CgCrLo_uApL)G,O _ #| # ^~~~~~~~~~~~~~~~~a lgo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C60C:L _note: Pfield 'group' will be initialized after field 'stepSize'R OTO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhkr)e;a d\s ) ,| ^t idIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 60g:r onote: ufield 'group' will be initialized after field 'stepSize'p (grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), n563t | h r e a ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60563: | note: field 'group' will be initialized after field 'stepSize' step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~~~~~~~e adI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nth r563e | a d s ( nsttherpeSaidzse)(,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdixz.exs)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~/ NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.htid):,562 :n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds(nthreads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h readI d563x | . x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m . c| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m m.bu f563f | S i z e ss[tNeCpCSLi_zPeR(OnTcOc_lSSIhMmPeLmE.]c/oNmCmC.Lb_uSfTfESPiSz/essi[zNeCoCfL(_TP)R)O T{O _ S| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| E group(group] /NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:S677/:s11i:z enote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref (T)) { 677 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i687d:S11t:a rnote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereB cast, 687n | T h r e a d s B c a sptr,i m&sd(itriedc-tt-i>doSutta,r tdBicraesctt,- >ndTohwrne,a dasrBgcsa-s>ts,e n&ddbiurfefc,t -a>rogust-,> rneuclvlbputfrf,, a r| g ^s ->sendbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:f202,: 53a:r gnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here- >rec v202b | u f f , | ^ RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:e202m:e53n:t , 2, 2>::run' requested heren , T, 202R | e d O p , A l gRou,n WPorroktEol>e(m)e.nrtu, 2, 2>::run' requested here> ().r u5n | (IwMeP)L;_ C O| L ^L _FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppl:l5R:e1d:u cnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested here, COL L5N | EITM_PDLI_RCEOCLTL,_ FSUINMCP(LAEl,l RMeidnu,c eu,i nCtO8L_LtN)E T _| D^I RECT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391S:I95M:P Lnote: Eexpanded from macro 'IMPL_COLL_FUNC', Min, u391i | n t 8R_utn)W o r| k^< nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hF:u391n:c95#:# fnote: uexpanded from macro 'IMPL_COLL_FUNC'n c, typ e391, | F uRnucn#W#odervkr#,f uNnCcC,L _tAyLpGeO,_ #F#uanlcg#o#,d eNvCrCeLd_oPpR#,p rNoCtCoL>_(A)L.GrOu_n#(#&anlcgcol,S hNmCeCmL._wPoRrOkT)O;_ #\# p r| o ^t o>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15&:n cnote: cfield 'nthreads' will be initialized after field 'tidInBlock'l Shmem .562w | o r k ) ;t i\d ( t| i ^d ), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~c k(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60I:d xnote: .field 'group' will be initialized after field 'stepSize'x ), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~, nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:( nnote: tfield 'group' will be initialized after field 'stepSize'h read s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~k (threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:60: note: 563field 'group' will be initialized after field 'stepSize' | s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkd,( tNiCdC)L,_ AnLtGhOr_e#a#dasl(gnot,h rNeCaCdLs_)P,R OtTiOd_I#n#Bplrooctko(>t(h)r.eraudnI(d&xn.cxc)l,S hgmreomu.pw(ogrrko)u;p )\, | | ^ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 563note: | field 'nthreads' will be initialized after field 'tidInBlock' st e562p | S i ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s563t | e p S i zset(enpcScilzSeh(mnecmc.lcSohmmme.mb.ucfofmSmi.zbeusf[fNSCiCzLe_sP[RNOCTCOL__SPIRMOPTLOE_]S/INMCPCLLE_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]S:/TN562EC:PC15SL:/_ Sswarning: Tiinitializer order does not match the declaration order [-Wreorder-ctor]EzP eSo/fs(iTz) e)562o f | {( T ) | ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t {i d | (| group(groupt ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ i d )| , group(group nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r666e:a9d:s (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: n:in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret687 h:r11e:a d666note: s | in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , t i d I 687n | Bp lr oi cm k s( (t th i rd e,pa rdinImTsdh(xrt.iexda)-dt,si dGgSatrtaorhuteBpcra(,sgt ,rd oinurTpeh)cr,te -a d>| suB ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~pc ,a s| Nt tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)U, L L&,d ia563rre | gc ts --> >o susettn,e dpnbSuuliflzfpe,t(r n,a cracrglgsSs--h>>msreeemncd.bvcubofufmf,mf .a,rb gu sf-| f> ^Sr eiczve/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hbsu:[f202Nf:C,53 C: L| _ ^note: P in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR OTO_ S202I | M /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP: L202 :E 53] :/ N note: C in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereCR Lu_nSWToE r202Pk | S E/ ls ei mz ee no tf gnote: o(in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here,) .Prruon t(o677w> | e( )) ;. r u n| ( ^ w e ) ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppp :r 6i| :m ^1 s:( tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered -ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppd: S56t: | a1Ir:M note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement:( )warning: .initializer order does not match the declaration order [-Wreorder-ctor]r un(we); 562| | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:i6d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read s6( | nItMhPrLe_aCdOsL)L,_ FtUiNdCI(nABllloRcekd(utcher,e aCdOILdLxN.ExT)_,D IgRrEoCuTp,( gSrIoMuPpL)E,, M| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n , | i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n t32_t )563 | | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem .391c | o m mR.ubnuWfofrSki),) N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A L G| O group(group_ ##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C641C:L11_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_##pr o641t | o > ( ) . r u n ( & npcrcilmSsh(mteimd.-wtoirdkS)t;a r\t R e| d ^u ce, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s Rnote: efield 'nthreads' will be initialized after field 'tidInBlock'd uce, d562i | r e c t -t>iddo(wtni,d )&,d inrtehcrte-a>dosu(tn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) , | g ^r oup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202):,53 : | note: ^~~~~~~~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 60 : note: field 'group' will be initialized after field 'stepSize' Run W562o | r k E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :g6r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup) ,6 | I| M ^~~~~~~~~~~P L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562m:.15w:o rwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]) ; \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthr e562a | d s ( n tthirde/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~( group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d swarning: )initializer order does not match the declaration order [-Wreorder-ctor], tidInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~, tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 15| : ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7: 1563: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here ste p7S | iIzMeP(Ln_cCcOlLSLh_mFeUmN.Cc(oAmlml.RbeudfufcSei,z eCsO[LNLCNCELT__PDRIORTEOC_TS,I MSPILMEP]L/EN,C CMLi_nS,T EuPiSn/ts3i2z_eto)f ( T| )^) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 391 :| 95 group(group: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 655391: | 11 : Rnote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Work< n655c | c l F u n c # # f u npcr,i mtsy(ptei,d -FtuindcS#t#adretvRreedduocpe<,t ynpTeh>r,e aNdCsCRLe_dAuLcGeO,_ #n#ualllgpot,r ,N C&CdLi_rPeRcOtT-O>_o#u#tp,r oatrog>s(-)>.sreunnd(b&unfcfc,l Sahrmgesm-.>wroerckv)b;u f\f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAIlngBol,o cPkr(otthor>e(a)d.Irduxn.(xw)e,) ;g r o| u ^p (group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp,: 6 :| 1 ^~~~~~~~~~~~~~~~~: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 606: | Inote: Mfield 'group' will be initialized after field 'stepSize'P L_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nMBilno,c ki(ntth3r2e_atd)I d x| .^x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95(:g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p), | ^~~~~~~~~~~391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti d562( | t i d ) ,t indt(htrieda)d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d ss((nn tt562hh | r re ea ad dsts)i),d, ( ttiitddI)in,dB IlnnotBchlkr(oetcahkdr(set(ahndrtIhderaxed.aIxdd)xs,.) x,g) rto,iu dpI(gngrBrlooucpk)(,t h or| ue ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~pa (d I| gd tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)rx o.uxp )563,) | , g r o | uspt ^~~~~~~~~~~~~~~~~(e gprSoiu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hzp:e)562(,:n 60c c| :l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ S note: hfield 'group' will be initialized after field 'stepSize' m| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m .com m 562.563 | b | u f f tS iisztedes(p[tSNiiCdzC)e,L( _nnPctRchOrlTSOeha_mSedImsM.(PcnLotEm]mh/.rNbeCuCfaLdf_SSsiT)zEeP,sS[/ NstCiiCzdLeI_onPfBR(lOToT)cO)k_ (S{I tM hPr| Le ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ea ]d /I| Nd group(groupCx .CxL)_,S T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hgE:rP666oS:u/p9s:(i gznote: reoin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereou fp()T ,)666 ) | | { ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | p group(groupr ims(tid,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n687T:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres Gather ,687 | d i r e c t - > u p ,p rNiUmLsL(,t iadr-gtsi-d>SsteanrdtbBucfafs,t ,a rngTsh-r>eraedcsvBbcuafsf,t , | & ^d irect-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:o202u:t53,: nnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel lptr ,202 | a r g s - > s e nRdubnuWfofr,k Ealregmse-n>trin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( ).ru n202( | w e ) ; | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppr:k6E:l1e:m enote: nin instantiation of member function 'RunWork, 2, 2>::run' requested heret , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:562,: 15a:r gwarning: s-initializer order does not match the declaration order [-Wreorder-ctor]> recvbuff, | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202(:t53i:d )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here nthr e202a | d s ( n t h r e aRdusn)W,o rtkiEdlIenmBelnotcp()),. r u| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( w e| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp563: | 6 : 1 : snote: tin instantiation of member function 'RunWork, 2, 2>::run' requested heree pS i6z | eI(MnPcLc_lCSOhLmLe_mF.UcNoCm(mA.lbluRfefdSuiczee,s [CNOCLCLLN_EPTR_ODTIOR_ESCITM,P LSEI]M/PNLCEC,L _MSiTnE,P Si/nsti3z2e_otf)( T )| )^ { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 391 :| 95 group(group: note: expanded from macro 'IMPL_COLL_FUNC' 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :R655u:n11W:o rnote: kin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here< ncclF u655n | c # # f u n c , t ypprei,m sF(utnicd#-#tdiedvSrteadrotpR,, nNTChCrLe_aAdLsGROe_d#u#cael,g on,u lNlCpCtLr_,P R&OdTiOr_e#c#tp-r>ootuot>,( )a.rrgusn-(>&snecncdlbSuhfmfe,m .awrogrsk-)>;r e\c v b| u ^ ff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered (tid) ,202 | n t h r e a d s (RnutnhWroerakdEsl)e,m etnitdp(()g.rrouunp()w,e ) ;| ^~~~~~~~~~~~~~~~~ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppfield 'group' will be initialized after field 'stepSize': 7:1 :562 | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid (7t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iCdOILnLBNlEoTc_kD(ItRhErCeTa,d ISdIxM.PxL)E,, gMrionu,p (ugirnotu3p2)_,t ) | ^~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , NtCiCdL(_tAiLdG)O,_ #n#tahlrgeoa,d sN(CnCtLh_rPeRaOdTsO)_,# #tpirdoItnoB>l(o)c.kr(utnh(r&enacdcIldSxh.mxe)m,. wgorroku)p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::60562:: 15note: :field 'group' will be initialized after field 'stepSize' warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h1::562 :note: 15in instantiation of member function 'RunWork, 2, 2>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 7 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aCdOsL(LnNtEhTr_eDaIdRsE)C,T ,t iSdIIMnPBLlEo,c kM(itnh,r euaidnItd3x2._xt)), g| r^o up(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563R | u n W o rskt_,S INMCPCLLE_]A/LNGCOC_L#_#SaTlEgPoS,/ sNiCzCeLo_fP(RTO)T)O _{# # p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o t o| > group(group( ).run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:c641l:S11h:m enote: min instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. work) ;641 | \ | ^ pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:m562s:(15t:i dnote: -field 'nthreads' will be initialized after field 'tidInBlock't idStart R562e | d u c e ,t indT(htrieda)d,s Rnetdhurceea,d sd(inrtehcrte-a>ddso)w,n ,t i&ddIinrBelcotc-k>(otuhtr,e aadrIgdsx-.>xs)e,n dgbruofufp,( garrogusp-)>,r e c| v ^~~~~~~~~~~~~~~~~b uff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: 562note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here t i202d | ( t i d ) , n tRhurneWaodrsk(Enltehmreenatd)(,) .grruonu(pw(eg)r;o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce ,7 | nITMhPrLe_aCdOsLRLe_dFuUcNeC,( AnlullRlepdturc,e ,& dCiOrLeLcNtE-T>_oDuItR,E CaTr,g sS-I>MsPeLnEd,b uMfifn,, aurignst-3>2r_etc)v b u| f^f , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hexpanded from macro 'IMPL_COLL_FUNC': 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | R u202n | W o r k < n c c lRFuunnWco#r#kfEulnecm,e nttyt,o >N(C)C.Lr_uAnL(GwOe_)#;# a l| g ^o , NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppP:R8O:T1O:_ #note: #in instantiation of member function 'RunWork, 2, 2>::run' requested herep roto >8( | )I.MrPuLn_(C&OnLcLc_lFSUhNmCe(mA.lwloRrekd)u;c e\, C| O ^L LNET_DIRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' Min, in t5626 | 4 _ t ) t i| d^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's (nthr e391a | d s )R,u ntWiodrIkn:,562 :N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'A LGO_## a562l | g o , NtCiCdL(_tPiRdO)T,O _n#t#hprreoatdos>((n)t.hrruena(d&sn)c,c ltSihdmIenmB.lwoocrkk()t;h r\e a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup), 562| | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):.562r:u15n:( wwarning: einitializer order does not match the declaration order [-Wreorder-ctor]) ; | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:i7d:(1t:i dnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, nth r7e | aIdMsP(Ln_tChOrLeLa_dFsU)N,C (tAildlIRneBdluoccek,( tChOrLeLaNdEITd_xD.IxR)E,C Tg,r oSuIpM(PgLrEo,u pM)i,n , | u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i n t| 3 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)2 _t) | 563^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:t391e:p95S:i znote: eexpanded from macro 'IMPL_COLL_FUNC'( ncclSh m391e | m . cRoumnmW.obrukff,( TN)C)C L{_ A L| G ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ #| # group(groupa lgo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_687P:R11O:T Onote: _in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #proto >687( | ) . r u n ( & n c c lpSrhimmesm(.twiodr-kt)i;d S\t a r| t ^B cast,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562T:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's Bcast ,562 | & d i r etcitd-(>toiudt),, nnutlhlrpetard,s (anrtghsr-e>asdesn)d,b utfifd,I naBrlgosc-k>(rtehcrvebaudfIfd,x . x| ) ^, grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : note: field 'group' will be initialized after field 'stepSize'R unWor k562E | l e m e ntti,( )t.irduInn(Bwleo)c;k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppx:.6x:)1,: gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up(g r6o | uIpM)P,L _ C| O ^~~~~~~~~~~L L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r15e:c vwarning: binitializer order does not match the declaration order [-Wreorder-ctor]u ff, | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53t:i dnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id), 202n | t h r e a d s ( nRtuhnrWeoardksE)l,e mteindtI((g)r.oruupn)(,w e )| ; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :5638 | : 1 : note: sin instantiation of member function 'RunWork, 2, 2>::run' requested heret epS i8z | eI(MnPcLc_lCSOhLmLe_mF.UcNoCm(mA.lbluRfefdSuiczee,s [CNOCLCLLN_EPTR_ODTIOR_ESCITM,P LSEI]M/PNLCEC,L _MSiTnE,P Si/nsti6z4e_otf)( T )| )^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : group(group95 : note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h391: | 687 : 11R:u nnote: Win instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo rkh,r eNaCdCsLB_cAaLsGtO,_ #&#dailrgeoc,t -N>CoCuLt_,P RnOuTlOl_p#t#rp,r oatrog>s(-)>.sreunnd(b&unfcfc,l Sahrmgesm-.>wroerckv)b;u f\f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :field 'nthreads' will be initialized after field 'tidInBlock'53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d )R,u nnWtohrrkeEaldesm(enntthd(x)..xr)u,n (gwreo)u;p ( g| r ^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp| : ^~~~~~~~~~~~~~~~~7 :1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 60: note: field 'group' will be initialized after field 'stepSize'7 | IMP L562_ | C O L L _tFiUdN(Ct(iAdl)l,R endtuhcree,a dCsO(LnLtNhErTe_aDdIsR)E,C Tt,i dSIInMBPlLoEc,k (Mtihnr,e audiIndtx3.2x_)t,) g r| o^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391):,95 : | note: ^~~~~~~~~~~expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r15g:s -warning: >initializer order does not match the declaration order [-Wreorder-ctor]s endbuff, 562a | r g s - >triedc(vtbiudf)f,, n t| h ^r eads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, tid I202n | B l o c k ( t h rReuandWIodrxk.Exl)e,m egnrto ( ) . rsutne(pwSei)z;e ( n| c ^c lShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppc:o9m:m1.:b unote: fin instantiation of member function 'RunWork, 2, 2>::run' requested heref Size s9[ | NICMCPLL__PCROOLTLO__FSUINMCP(LAEl]l/RNeCdCuLc_eS,T ECPOSL/LsNiEzTe_oDfI(RTE)C)T ,{ S I| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P L E| , group(group Min, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:66554:_11t:) note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h655: | 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' pri m391s | ( t iRdu-ntWiodrSktt,- >NoCuCtL,_ AaLrGgOs_-#>#saelngdob,u fNfC,C La_rPgRsO-T>Or_e#c#vpbruoftfo,> ( )| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE,: 562M:i15n:, warning: iinitializer order does not match the declaration order [-Wreorder-ctor]n t64_t) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthr e391a | d s (RnutnhWroerakd, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhrea:d562I:d15x:. xwarning: )initializer order does not match the declaration order [-Wreorder-ctor], group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~i d), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: (field 'group' will be initialized after field 'stepSize'n thread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraoudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h rea d563I | d x . x )s,t egprSoiuzpe((gnrcoculpS)h,m e m| . ^~~~~~~~~~~c omm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hproto:>666(:)9.:r unote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( &ncclShme m666. | w o r k ) ; \ p r| i ^m s(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:T15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd sGath e562r | , d i rteicdt(-t>iudp),, NnUtLhLr,e aadrsg(sn-t>hsreenaddbsu)f,f ,t iadrIgnsB-l>orcekc(vtbhurfefa,d I d| x ^. x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 60 : note: field 'group' will be initialized after field 'stepSize' Run W562o | r k E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppg:r8o:u1p:( gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up), 8 | | I ^~~~~~~~~~~M PL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tiSTEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here562 :15: warning: 202initializer order does not match the declaration order [-Wreorder-ctor] | 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(hwree)a;d I d| x ^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:u9p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | 9 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | I M| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _COL L563_ | F U N C (sAtlelpRSeidzuec(en,c cClOSLhLmNeEmT._cDoImRmE.CbTu,f fSSIiMzPeLsE[,N CMCiLn_,P RuOiTnOt_6S4I_MtP)L E ]| /^N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:T391E:P95S:/ snote: iexpanded from macro 'IMPL_COLL_FUNC'z eof(T )391) | { R u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W o r| k group(group< ncclFunc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h#:#666f:u9n:c ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ype, 666F | u n c # # d e v rperdiomps<(ttyipde,> ,n TNhCrCeLa_dAsLGGaOt_h#e#ra,l gdoi,r eNcCtC-L>_uPpR,O TNOU_L#L#,p raortgos>-(>)s.ernudnb(u&fnfc,c laSrhgmse-m>.rweocrvkb)u;f f\, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAIlngBol,o cPkr(otthor>e(a)d.Irduxn.(xw)e,) ;g r o| u ^p (grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppp:)8,: 1 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :8562 | :I60M:P Lnote: _field 'group' will be initialized after field 'stepSize'C OLL_FU N562C | ( A l l Retdiudc(et,i dC)O,L LnNtEhTr_eDaIdRsE(CnTt,h rSeIaMdPsL)E,, tMiidnI,n Bilnotc6k4_(tt)h r e| a^d Idx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x391):,95 :g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p(gr o391u | p ) ,R u n| W ^~~~~~~~~~~o rk, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>().ru:n562(:&15n:c cwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S hmem.work); \ 562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~n cclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m60.:c onote: mfield 'group' will be initialized after field 'stepSize'm .buffS i562z | e s [ N CtCiLd_(PtRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p666):,9 : | note: ^~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, CO nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]8 | IMPL_C O562L | L _ F U NtCi(dA(ltliRde)d,u cnet,h rCeOaLdLsN(EnTt_hDrIeRaEdCsT),, StIiMdPILnEB,l oMcikn(,t hirneta6d4I_dtx). x )| ,^ group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g391r:o95u:p )note: ,expanded from macro 'IMPL_COLL_FUNC' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)391 | R u563n | W o r k M,P LNEC]C/LN_CACLLG_OS_T#E#PaSl/gsoi,z eNoCfC(LT_)P)R O{T O _| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~# p r| o group(groupt o>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:n655c:c11l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.wor k655) | ; \ | ^ pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:m562s:(15t:i dnote: -field 'nthreads' will be initialized after field 'tidInBlock't idSta r562t | R e d u ctei,d (ntTihdr)e,a dnstRherdeuacdes,( nntuhlrlepatdrs,) ,& dtiirdeIcntB-l>oocukt(,t harregasd-I>dsxe.nxd)b,u fgfr,o uapr(ggsr-o>urpe)c,v b u| f ^~~~~~~~~~~~~~~~~f , | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: 562note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ti d202( | t i d ) , n t hRruenaWdosr(knEtlhermeeandts<)F,n ,t iTd,I nRBeldoOcpk,( tAhlrgeoa,d IPdrxo.txo)>,( )g.rrouunp((wger)o;u p )| , ^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]R unWorkE l562e | m e n t ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.homm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~t hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h687::56211::15 :note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herewarning: initializer order does not match the declaration order [-Wreorder-ctor] 687 | 562 | pr itmisd((ttiidd-)t,i dnSttharretaBdcsa(sntt,h rneTahdrse)a,d stBicdaIsntB,l o&cdki(rtehcrte-a>doIudtx,. xn)u,l lgprtoru,p (agrrgosu-p>)s,e n d| b ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u f f| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) args- >563r | e c v b usftfe,p S i| z ^e (ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:m202e:m53.:c onote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .buf f202S | i z e s [ N C C LR_uPnRWOoTrOk_ESlIeMmPeLnEt]| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) . r| u group(groupn (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 9note: :in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 626 | 9 | I M P L _ C OpLrLi_FmUsN(Ct(iAdl-ltRieddSutcaer,t SCcOaLtLtNeErT,_ DnITRhErCeTa,d sSSIcMaPtLtEe,r ,M iNnU, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(gr o563u | p ) , s| t ^~~~~~~~~~~e pSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15: note: field 'nthreads' will be initialized after field 'tidInBlock' :562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(60g:r onote: ufield 'group' will be initialized after field 'stepSize'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid (563t | i d ) , snttehprSeiazdes((nnctchlrSehamdesm).,c otmimd.IbnuBflfoScikz(etsh[rNeCaCdLI_dPxR.OxT)O,_ SgIrMoPuLpE(]g/rNoCuCpL)_,S T E| P ^~~~~~~~~~~S /sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threa d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L E ]| / tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_S T563E | P S / s iszteeopfS(iTz)e)( n{c c l| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h m e| m group(group. comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:i677z:e11s:[ Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC L_PROT O677_ | S I M P L E ] / N C CpLr_iSmTsE(PtSi/ds-itziedoSft(aTr)t)B c{a s t| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n T| h group(groupr eadsBcast, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:i641r:e11c:t -note: >in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo ut, di r641e | c t - > d o w n , aprrgism-s>(steindd-btuifdfS,t aarrtgRse-d>urceec,v bnuTfhfr,e a d| s ^R educe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202d:i53r:e cnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here- >dow n202, | & d i r e c t -R>uonuWto,r kaErlgesm-e>nste rAelcgvob,u fPfr,o t o| > ^( ).run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:)202;: 53 :| ^note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :2029 | : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here R u9n | WIoMrPkLE_lCeOmLeLn_tFE(C)T.,r uSnI(MwPeL)E;, M| i ^n , uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp6:410_:t1): note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h10: | 391I:M95P:L _note: Cexpanded from macro 'IMPL_COLL_FUNC'O LL_FUN C391( | A l lRRuendWuocrek,< nCcOcLlLFNuEnTc_#D#IfRuEnCcT,, tSyIpMeP,L EF,u nMci#n#,d ehvarlefd)o p <| t^y pe>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :N391C:C95L:_ Anote: Lexpanded from macro 'IMPL_COLL_FUNC'G O_##alg o391, | N CRCuLn_WPoRrOkT#(f)u.nrun(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562 : 15R:u nwarning: Winitializer order does not match the declaration order [-Wreorder-ctor]o rkEleme n562t | < F n , tTi,d (RteiddO)p,, nAtlhgroe,a dPsr(onttoh>r(e)a.drsu)n,( wtei)d;I n B| l ^o ck(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppI:d10x:.1x:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereg roup( g10r | oIuMpP)L,_ C O| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ F| U tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N C(Al l563R | e d u c es,t eCpOSLiLzNeE(Tn_cDcIlRSEhCmTe,m .ScIoMmPmL.Eb,u fMfiSni,z ehsa[lNfC)C L _| P^R OTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M391P:L95E:] /note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_STEP S391/ | s i zReuonfW(oTr)k)< n{c c l| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n c| # group(group# func, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hy:p626e:,9 :F unote: nin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec ##dev r626e | d o p < t y p e >p,r iNmCsC(Lt_iAdL-GtOi_d#S#taalrgtoS,c aNtCtCeLr_,P RnOTThOr_e#a#dpsrSoctaot>t(e)r.,r uNnU(L&Ln,c cdliSrhemcetm-.>wuopr,k )a;r g\s - >| s ^e ndbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r15g:s -note: >field 'nthreads' will be initialized after field 'tidInBlock'r ecvbu f562f | , | ^t id(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds(n t202h | r e a d s ) , tRiudnIWnoBrlkoEclke(mtehnrte ^~~~~~~~~~~~~~~~~( ).r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w60e:) ;note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:i10d:(1t:i dnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, nth r10e | aIdMsP(Ln_tChOrLeLa_dFsU)N,C (tAildlIRneBdluoccek,( tChOrLeLaNdEITd_xD.IxR)E,C Tg,r oSuIpM(PgLrEo,u pM)i,n , | h ^~~~~~~~~~~a lf) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:)562;: 15\: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), 563| | ^~~~~~~~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S60i:z enote: (field 'group' will be initialized after field 'stepSize'n cclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~: 687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hock(th:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~t id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(group:)562,: 15 :| ^~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthreads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,60 :g rnote: ofield 'group' will be initialized after field 'stepSize'u p(grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~C L_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d InBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PRO T563O | _ S I M PsLtEe]p/SNiCzCeL(_nScTcElPSSh/mseimz.ecoofm(mT.)b)u f{f S i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e s [| N group(groupC CL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S655I:M11P:L Enote: ]in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/ NCCL_ S655T | E P S / s i z e o f (pTr)i)m s{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- t i| d group(groupS tartRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:e641,: 11n:T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h h:note: r562in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree: a15d:s Rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]641d | u c e , 562 n | u l l p ttrpi,rd i(&mtdsii(drt)ei,c dtn--tt>hiorduetSa,td asar(rtngRtseh-dr>eusacedens,d) b,nu Tfthfir,de IaandrBsglRsoe-cd>kur(cetech,vr beduaifdrfIe,dc xt .-| x> ^)d ,o wgnr,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho :u&202pd(:ig53rr:eo cunote: tpin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here-) >,o u t202| , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ a r | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s - > s eR563nu | dn bW uo fr fks,Etl eeapmrSeginszt-e<>(Frnnec,cc lvTSb,hu mfRefem,d. Oc po| ,m ^ m A.lbguof,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf :SPi202rz:oe53ts:o[ >Nnote: (Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here)C .Lr_uP nR202(O | wT eO )_ ;S I M | P ^L ER]u/n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppNW:Co10Cr:Lk1_E:Sl Tenote: Emin instantiation of member function 'RunWork, 2, 2>::run' requested herePe Sn/t s<10iF | znIe,Mo PfTL(,_T C)RO)Le Ld{_O Fp U,| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~CA (l Ag| lo group(groupl, R ePdruo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hct:eo677,>: (11C:)O Lnote: .Lin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hererN uEnT(_w De677I) | R; E C T| , ^ S I M P L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp E:p,10r :iMm1is:n( ,tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herehd a-ltf i)10d S | tI| aM^r PtLB_c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hCa:Os391Lt:L,95_ :Fn UTnote: Nhexpanded from macro 'IMPL_COLL_FUNC'Cr (eAald ls391RB | ec da usRctue,n, W o&CrdOkiLDuoInuRctE#,C# Tfd,ui nrSceI,cM tPt-Ly>Epd,eo ,wM niF,nu ,na crhg#as#l-df>e)vs re en| dd^bo pua95,r: g sNnote: -Cexpanded from macro 'IMPL_COLL_FUNC'>C rLe_cAv Lb391Gu | Of _f #,R #u anl| Wg ^oo r,k , 2, 2>::run' requested hereO# _f#u# np202cr | ,o t to y> p( e) ,. r FuRunun(nc&Wn#oc#rcdkleESvlhrememedemon.ptw<T ,,\ NR Ce| Cd ^LO _pA,L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h G:AO562l:g_15o#:,# anote: Plfield 'nthreads' will be initialized after field 'tidInBlock'rg oot,o >N562(C | )C .L r_ uP nRt(OiwTdeO()_t;#i #d p)| r, ^o tnot>h(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppr):e.9ar:du1sn:(( n¬e: tnin instantiation of member function 'RunWork, 2, 2>::run' requested herehc rcel aS9dh | smI)eM,mP .Ltw_ioCdrOIkLn)LB;_lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~( ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~n cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hadIdx.x),: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e (ncc l563S | h m e m .sctoempmS.ibzuef(fnSciczleSsh[mNeCmC.Lc_oPmRmO.TbOu_fSfSIiMzPeLsE[]N/CNCCLC_LP_RSOTTEOP_SS/IsMiPzLeEo]f/(NTCCL_)ST)E P{S / s| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e o| f group(group( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : group(group677 :11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: 677note: | in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | p r i m s ( t i dp-rtiimdsS(ttairdt-BtciadsStt,a rntTRherdeuacdes,B cnaTshtr,e a&ddsiRreedcutc-e>,o udti,r edcitr-e>cdto-w>nd,o w&nd,i raercgts-->>osuetn,d baurfgfs,- >asregnsd-b>urfefc,v baurfgfs,- > r| e ^c vbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n W o r k ERluenmWeonrtk (P)r.ortuon>((w)e.)r;u n (| w ^e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppin instantiation of member function 'RunWork, 2, 2>::run' requested here: 10:1: 10note: | in instantiation of member function 'RunWork, 2, 2>::run' requested hereI MPL_ C10O | LILM_PFLU_NCCO(LALl_lFRUeNdCu(cAel,l RCeOdLuLcNeE,T _CDOILRLENCETT,_ DSIIRMEPCLTE,, SMIiMnP,L Eh,a lMfi)n , | h^a lf) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: 391expanded from macro 'IMPL_COLL_FUNC' | RunW o391r | k < nRcucnlWFournkc<#n#cfculnFcu,n ct#y#pfeu,n cF,u ntcy#p#ed,e vFruendco#p#d,o pNL,G ON_C#C#La_lAgLoG,O _N#C#CaLl_gPoR,O TNOC_C#L#_pPrRoOtToO>_(##)p.rroutno(>&(n)c.crluSnh(m&enmc.cwloSrhkm)e;m .\w o r| k ^) ; \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::60562:: 60note: :field 'group' will be initialized after field 'stepSize' note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | roto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 677 | 562 | p r i mtsi(dt(itdi-dt)i,d SnttahrrteBacdass(tn,t hnrTehardesa)d,s BtciadsItn,B l&odcikr(etchtr-e>aoduItd,x .dxi)r,e cgtr-o>udpo(wgnr,o uapr)g,s - >| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e n d| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ff, ar g563s | - > r e csvtbeupfSfi,z e (| n ^c clShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:o202m:m53.:b unote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref Size s202[ | N C C L _ P R O TROu_nSWIoMrPkLEEl]e/mNeCnCtL<_FSnT,E PTS,/ sRiezdeOopf,( TA)l)g o{, P| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o t o| > group(group( ).run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h;: 626 :| 9 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11: 1626: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | pIrMiPmLs_(CtOiLdL-_tFiUdNSCt(aArltlSRceadtutceer,, CnOTLhLrNeEaTd_sDSIcRaEtCtTe,r ,S INMUPLL, direct->up, args-L>Es,e nMdibnu,f ff,l oaartg)s - >| r^e cvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f391f:,95 : | note: ^expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391202 | : 53 :R unote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereW ork< n202c | c l F u n c # # fRuunncW,o rtkyEplee,m eFnutng,o ,N CPCrLo_tAoL>G(O)_.#r#uanl(gwoe,) ;N C C| L ^_ PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp#:p11r:o1t:o >note: (in instantiation of member function 'RunWork, 2, 2>::run' requested here) .run (11& | nIcMcPlLS_hCmOeLmL._wFoUrNkC)(;A l\l R e| d ^u ce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562N:E15T:_ Dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'R ECT, S562I | M P L E ,t iMdi(nt,i df)l,o antt)h r e| a^d s(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:) ,note: expanded from macro 'IMPL_COLL_FUNC't idInBl o391c | k ( tRhurneWaodrIkd , N CtCiLd_(AtLiGdO)_,# #natlhgroe,a dNsC(CnLt_hPrReOaTdOs_)#,# ptriodtIon>B(l)o.crku(nt(h&rnecacdlISdhxm.exm).,w ogrrko)u;p (\g r ou| p ^) , | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]expanded from macro 'IMPL_COLL_FUNC' 391 | 562 | R u n W otrikd<(ntcicdl)F,u nnct#h#rfeuandcs,( nttyhpree,a dFsu)n,c #t#iddeIvnrBeldoocpk<(ttyhpree>a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R O T| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##pro t563o | > ( ) . rsutne(p&Sniczcel(SnhcmcelmS.hwmoermk.)c;o m\m . b| u ^f fSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C15C:L _note: Pfield 'nthreads' will be initialized after field 'tidInBlock'R OTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r626e:a9d:I dnote: xin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. x), g r626o | u p ( g r o u p )p,r i m| s ^~~~~~~~~~~~~~~~~( tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:S60t:a rnote: tfield 'group' will be initialized after field 'stepSize'S catte r562, | n T h rteiadd(stSicda)t,t enrt,h rNeUaLdLs,( ndtihrreecatd-s>)u,p ,t iadrIgnsB-l>oscekn(dtbhurfefa,d Iadrxg.sx-)>,r egcrvobuupf(fg,r o u| p ^) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement ( ) . rtuind((wtei)d;) , | n ^t hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppn:t11h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here) , ti d11I | nIBMlPoLc_kC(OtLhLr_eFaUdNICd(xA.lxl)R,e dgurcoeu,p (CgOrLoLuNpE)T,_ D I| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E C | T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), SIMP L563E | , M i ns,t efplSoiazte)( n c| c^l Shmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:c391o:m95m:. bnote: uexpanded from macro 'IMPL_COLL_FUNC'f fSize s391[ | N C CRL_unWoPrRkO , NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:G641O:_11#:# anote: lin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg o, NC C641L | _ P R O T O _ # # p rportiom>s(()t.irdu-nt(i&dnSctcalrSthRmeedmu.cweo,r kn)T;h r\e a d| s ^R educe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562d:i15r:e cnote: tfield 'nthreads' will be initialized after field 'tidInBlock'- >down ,562 | & d i r etcitd-(>toiudt),, anrtghsr-e>asdesn(dnbtuhfrfe,a dasr)g,s -t>irdeIcnvBbluofcfk,( t h| r ^e adIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)202,: 53g:r onote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep (gro u202p | ) , | ^~~~~~~~~~~~~~~~~ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:W562o:r60k:E lnote: efield 'group' will be initialized after field 'stepSize'm ent((n)t.hrruena(dwse)),; t i| d ^I nBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp(:t12h:r1e:a dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hered x.x) ,12 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~l lReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct-threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 11warning: :initializer order does not match the declaration order [-Wreorder-ctor]1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11562 | | I M P L _tCiOdL(Lt_iFdU)N,C (nAtlhlrReeaddusc(en,t hCrOeLaLdNsE)T,_ DtIiRdEICnTB,l oScIkM(PtLhEr,e aMdiInd,x .fxl)o,a tg)r o u| p^( grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)391,: 95 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: expanded from macro 'IMPL_COLL_FUNC' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563 | R u n W osrtkeS,I MNPCLCEL]_/ANLCGCOL__#S#TaElPgSo/,s iNzCeCoLf_(PTR)O)T O{_ # #| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o t| o group(group> ().run(&ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:e687m:.11w:o rnote: kin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) ; \ | 687 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :p15r:i mnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( tid-t i562d | S t a r ttBicda(stti,d )n,T hnrtehardesaBdcsa(sntt,h r&edaidrse)c,t -t>ioduItn,B lnouclkl(ptthrr,e aadrIgdsx-.>xs)e,n dgbruofufp,( garrogusp-)>,r e c| ^~~~~~~~~~~~~~~~~v buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 60| : ^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p )(562.g | rr uon u( p w)t,ei )d ;(| t ^~~~~~~~~~~ i | d ^) , nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppd:s12(:n1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea ds) ,12 | tIiMdPILn_BClOoLcLk_(FtUhNrCe(aAdlIldRxe.dxu)c,e ,g rCoOuLpL(NgErTo_up), D| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R E C| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), SIM P563L | E , M isnt,e pdSoiuzbel(en)c c l| S^h mem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:m391m:.95b:u fnote: fexpanded from macro 'IMPL_COLL_FUNC'S izes[ N391C | C L _RPuRnOWToOr_kS, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:A687L:G11O:_ #note: #in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea lgo, N C687C | L _ P R O T O _ # # pprroitmos>((t)i.dr-utni(d&SntcacrltSBhcmaesmt.,w onrTkh)r;e a\d s B| c ^a st, &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562t:-15>:o unote: tfield 'nthreads' will be initialized after field 'tidInBlock', nullp t562r | , a r gtsi-d>(steindd)bu,f fn,t harregasd-s>(rnetchvrbeuafdfs,) , | t ^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCL_AL:G562O:_15#:# awarning: linitializer order does not match the declaration order [-Wreorder-ctor]g o, NCCL_PROTO_# #562p | r o t o >t(i)d.(rtuind()&,n cnctlhSrhemaedms.(wnotrhkr)e;a d\s ) ,| ^t idInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s (| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread s563) | , t i dsItneBplSoiczke((tnhcrcelaSdhImdexm..xc)o,m mg.rbouufpf(Sgirzoeusp[)N,C C L| _ ^~~~~~~~~~~~~~~~~P ROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l641o:c11k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadIdx .641x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~- tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_ # #| a ^~~~~~~~~~~l go, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ect->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : warning: sinitializer order does not match the declaration order [-Wreorder-ctor]t epSize(n c562c | l S h m etmi.dc(otmimd.)b,u fnftShirzeeasd[sN(CnCtLh_rPeRaOdTsO)_,S ItMiPdLIEn]B/lNoCcCkL(_tShTrEePaSd/Isdixz.exo)f,( Tg)r)o u{p ( g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ) group(group, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666 :s9t:e pnote: Sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ze(ncc l666S | h m e m . c o m mp.rbiumfsf(Stiizde,s [nNTChCrLe_aPdRsOGTaOt_hSeIrM,P LdEi]r/eNcCtC-L>_uSpT,E PNSU/LsLi,z eaorfg(sT-)>)s e{n d b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f ,| group(groupa rgs->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hb:u687f:f11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^ 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here prim s202( | t i d - t i d S tRaurntWBocraksEtl,e mneTnhtr oPurto,t on>u(l)l.prturn,( waer)g;s - >| s ^e ndbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp :a12r:g1s:- >note: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree cvbuf f12, | I M| P ^L _COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:N202C:(53A:l lnote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree duce ,202 | C O L L N E T _ DRIuRnEWCoTr,k ESlIeMmPeLnEt,< FMni,n ,T ,d oRuebdlOep), A| l^g o, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391t:o95>:( )note: .expanded from macro 'IMPL_COLL_FUNC'r un(we); 391 | | ^ RunWork, 2, 2>::run' requested here# #func ,12 | tIyMpPeL,_ CFOuLnLc_#F#UdNeCv(rAeldloRpe ,C ONLCLCNLE_TA_LDGIOR_E#C#Ta,l gSoI,M PNLCEC,L _MPiRnO,T Od_o#u#bplreo)t o >| (^) .run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.work) ;391 | \ R| u ^n Workt,h rNeCaCdLs_)A,L GtOi_d#I#naBllgooc,k (NtChCrLe_aPdRIOdTxO._x#)#,p rgortoou>p(()g.rrouunp()&,n c c| l ^~~~~~~~~~~~~~~~~S hme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562w:o60r:k )note: ;field 'group' will be initialized after field 'stepSize' \ | ^562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )note: ,field 'nthreads' will be initialized after field 'tidInBlock' nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~x .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95::562 :note: 15expanded from macro 'IMPL_COLL_FUNC': warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWork(,t hNrCeCaLd_IAdLxG.Ox_)#,# aglrgoou,p (NgCrCoLu_pP)R,O T O| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# # p| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o to>( )563. | r u n ( &sntcecplSSihzmee(mn.cwcolrSkh)m;e m\. c o| m ^m .buffSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562[:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'P ROTO_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:d626I:d9x:. xnote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, grou p626( | g r o u p ) , p| r ^~~~~~~~~~~~~~~~~i ms(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:-562t:i60d:S tnote: afield 'group' will be initialized after field 'stepSize'r tScatt e562r | , n T htrieda(dtsiSdc)a,t tnetrh,r eNaUdLsL(,n tdhirreeacdts-)>,u pt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 391warning: :initializer order does not match the declaration order [-Wreorder-ctor]95 : note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _PRO T563O | _ # # p rsotteop>S(i)z.er(unnc(c&lnSchcmleSmh.mceomm.mw.obrukf)f;S i\z e s| [ ^N CCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15S:I Mnote: Pfield 'nthreads' will be initialized after field 'tidInBlock'L E]/NC C562L | _ S T E PtSi/ds(itziedo)f,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n t| h group(groupr eads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d626I:n9B:l onote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (thre a626d | I d x . x ) , gprroiumps((gtriodu-pt)i,d S t| a ^~~~~~~~~~~~~~~~~r tSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:t562t:e60r:, note: nfield 'group' will be initialized after field 'stepSize'T hrea d562s | S c a t tteird,( tNiUdL)L,, ndtihrreecatd-s>(unpt,h raeragdss-)>,s etniddbIunfBfl,o cakr(gtsh-r>eraedcIvdbxu.fxf),, g| r ^o up(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202):,53 : | note: ^~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | : group(group562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562666 | | t i d ( tpirdi)m,s (nttihdr,e andTsh(rnetahdrseGaadtsh)e,r ,t iddiIrneBclto-c>ku(pt,h rNeUaLdLI,d xa.rxg)s,- >gsreonudpb(ugfrfo,u pa)r,g s -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e c| v tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)b uff, 563| | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:S202i:z53e:( nnote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec lShm e202m | . c o m m . b u fRfuSniWzoersk[ENlCeCmLe_nPtRe(o)f.(rTu)n)( w{e ) ;| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677 :1311 | :I Mnote: Pin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _COLL_ F677U | N C ( A l l R e d u cper,i mCsO(LtLiNdE-Tt_iDdISRtEaCrTt,B cSaIsMtP,L En,T hMriena,d srBcccals_tb,f l&odaitr1e6c)t - >| o^u t, d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r391e:c95t:- >note: dexpanded from macro 'IMPL_COLL_FUNC'o wn, ar g391s | - > sReunndWbourfkf<,n cacrlgFsu-n>cr#e#cfvubnucf,f ,t y p| e ^, Func##de/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hv:r202e:d53o:p , 2, 2>::run' requested herey pe>, N202C | C L _ A L G O _ #R#uanlWgoor,k ENlCeCmLe_nPtRp(,) .Arlugno(,& nPcrcoltSoh>m(e)m..rwuonr(kw)e;) ;\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 11note: :field 'nthreads' will be initialized after field 'tidInBlock'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 11t | iIdM(PtLi_dC)O,L Ln_tFhUrNeCa(dAsl(lnRtehdruecaed,s )C,O LtLiNdEITn_BDlIoRcEkC(Tt,h rSeIaMdPILdEx,. xM)i,n ,g rfoluopa(tg)r o u| p^) , | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h95::562 :note: 60expanded from macro 'IMPL_COLL_FUNC': note: field 'group' will be initialized after field 'stepSize' 391 | 562 | R u n Wtoirdk(I,d xN.CxC)L,_ AgLrGoOu_p#(#garloguop,) ,N C C| L ^~~~~~~~~~~_ PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::677 :warning: 11initializer order does not match the declaration order [-Wreorder-ctor]: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562677 | | t i d ( t i dp)r,i mnst(htrieda-dtsi(dnSttharretaBdcsa)s,t ,t indTIhnrBelaodcskB(ctahsrte,a d&Iddixr.exc)t,- >goruotu,p (dgirroeucpt)-,> d o| w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r gs->s e563n | d b u f fs,t eaprSgisz-e>(rnecccvlbSuhfmfe,m . c| o ^m m.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z202e:s53[:N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _PRO T202O | _ S I M P L E ] /RNuCnCWLo_rSkTEElPeSm/esnitz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)641.:r11u:n (note: win instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ); | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp : 13 : 1 : pnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested herei ms(t i13d | -ItMiPdLS_tCaOrLtLR_eFdUuNcCe(,A lnlTRherdeuacdes,R eCdOuLcLeN,E Td_iDrIeRcEtC-T>,d oSwInM,P L&Ed,i rMeicnt,- >rocuctl,_ bafrlgosa-t>1s6e)n d b| u^f f, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:r enote: cexpanded from macro 'IMPL_COLL_FUNC'v buff, 391| | ^ RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:<202n:c53c:l Fnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren c##f u202n | c , t y p e , RFuunnWco#r#kdEelvermeednotp<,, RNeCdCOL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp_:,A562 L:AG15lO:g_ o#warning: ,#initializer order does not match the declaration order [-Wreorder-ctor] a Plrgoot, o 562>NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :warning: initializer order does not match the declaration order [-Wreorder-ctor]warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t e p Ssitzeep(Sniczcel(SnhcmcelmS.hcmoemmm..cboumfmf.SbiuzfefsS[iNzCeCsL[_NPCRCOLT_OP_RSOITMOP_LSEI]M/PNLCEC]L/_NSCTCELP_SS/TsEiPzSe/osfi(zTe)o)f ({T ) )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~{ | | group(group ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h655::68711::11 :note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | 687 | p r ipmrsi(mtsi(dt-itdi-dtSitdaSrttaRretdBuccaes,t ,n TnhTrheraedasdRseBdcuacset,, n&udlilrpetcrt,- >&oduitr,e cntu-l>loputtr,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:iz562e:o15f:( Twarning: )initializer order does not match the declaration order [-Wreorder-ctor]) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d626(:t9i:d )note: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthrea d626s | ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteSacdaItdtxe.rx,) ,n Tghrroeuapd(sgSrcoautpt)e,r , | N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~U L L| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) direc t563- | > u p , satregpsS-i>zsee(nndcbculfSfh,m eamr.gcso-m>mr.ebcuvfbfuSfifz,e s [| N ^C CL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:O202_:S53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE ]/NC C202L | _ S T E P S / s iRzuenoWfo(rTk)E)l e{m e n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~< F n| , group(group T, RedOp, Alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:,666 :P9r:o tnote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> ().run (666w | e ) ; | ^ prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppt:i13d:,1 :n Tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eads G13a | tIhMePrL,_ CdOiLrLe_cFtU-N>Cu(pA,l lNRUeLdLu,c ea,r gCsO-L>LsNeEnTd_bDuIfRfE,C Ta,r gSsI-M>PrLeEc,v bMuifnf,, r c| c ^l _bfloat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h1:6202): 53 :| ^note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202391 | : 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | E l eRmuennWtou(n)c.#r#udne(vwree)d;o p <| t ^y pe>, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppN:C13C:L1_:A Lnote: Gin instantiation of member function 'RunWork, 2, 2>::run' requested hereO _##a l13g | oI,M PNLC_CCLO_LPLR_OFTUON_C#(#AplrloRteod>u(c)e.,r uCnO(L&LnNcEcTl_SDhImReEmC.Tw,o rSkI)M;P L\E , | M ^i n, rc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:l562_:b15f:l onote: afield 'nthreads' will be initialized after field 'tidInBlock't 16) | 562^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgerdooupp<)t,y p e| > ^~~~~~~~~~~~~~~~~, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60A:L Gnote: Ofield 'group' will be initialized after field 'stepSize'_ ##al g562o | , N C CtLi_dP(RtOiTdO)_,# #nptrhorteoa>d(s)(.nrtuhnr(e&andcsc)l,S htmiedmI.nwBolrokc)k;( t\h r e| a ^d Idx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock'( group )562, | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]687 | 562 | p r i m st(itdi(dt-itdi)d,S tnatrhtrBecaadsst(,n tnhTrheraedasd)s,B ctaisdtI,n B&ldoicrke(ctth-r>eoaudtI,d xn.uxl)l,p tgrr,o uapr(ggsr-o>uspe)n,d b u| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r gs->r e563c | v b u f fs,t e p| S ^i ze(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo mm.b u202f | f S i z e s [ N CRCuLn_WPoRrOkTEOl_eSmIeMnPtL ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(groupw e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 13note: :in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 641 | 13 | I M P L _ C O LpLr_iFmUsN(Ct(iAdl-ltRieddSutcaer,t RCeOdLuLcNeE,T _nDTIhRrEeCaTd,s RSeIdMuPcLeE,, dMiirne,c tr-c>cdlo_wbnf,l o&adti1r6e)c t -| >^o ut, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r391g:s95-:> snote: eexpanded from macro 'IMPL_COLL_FUNC'n dbuff, 391a | r g sR-u>nrWeocrvkb, 2, 2>::run' requested hereF unc# #202d | e v r e d o p < tRyupneW>o,r kNEClCeLm_eAnLtGt(o)>.(r)u.nr(uwne()&;n c c| l ^S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppw:o12r:k1):; note: \in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 12 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_15C:O Lnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ FUNC( A562l | l R e d utcied,( tCiOdL)L,N EnTt_hDrIeRaEdCsT(,n tShIrMePaLdEs,) ,M itni,d IdnoBulbolcek)( t h| r^e adI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(gr o391u | p ) ,R u n| W ^~~~~~~~~~~~~~~~~o rk/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:n562c:c60l:F unote: nfield 'group' will be initialized after field 'stepSize'c ##fun c562, | t y p et,i dF(utnicd#)#,d envtrherdeoapdr,e aNdCsC)L,_ AtLiGdOI_n#B#laolcgko(,t hNrCeCaLd_IPdRxO.TxO)_,# #gprrooutpo(>g(r)o.urpu)n,( & n| c ^~~~~~~~~~~c lShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:(562):.r15u:n (warning: &initializer order does not match the declaration order [-Wreorder-ctor]n cclShmem.w o562r | k ) ; \t i d| ( ^t id), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: (field 'nthreads' will be initialized after field 'tidInBlock'n thre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a dIdx.x) ,563 | g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~l Sh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:e562m:.60c:o mnote: mfield 'group' will be initialized after field 'stepSize'. buff S562i | z e s [ NtCiCdL(_tPiRdO)T,O _nStIhMrPeLaEd]s/(NnCtChLr_eSaTdEsP)S,/ tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p655):,11 : | note: ^~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cppf:l1a: gIn file included from 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 10d: aIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ha:2168,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hf:l153a:g142:; warning: unused variable 'data1' [-Wunused-variable]| ^~~~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlicIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp*:w1a: rIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :+10 : 2In file included from */usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hw:i168d: ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153| : ^14 : warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkh,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k )| ; tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) \ | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p15S:i znote: efield 'nthreads' will be initialized after field 'tidInBlock'( ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h916::5627::60 :note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: field 'group' will be initialized after field 'stepSize' 916 | 562 | tpirdi(mtsi(dg)r,o unptThirde,a dgsr(onutphNrtehardesa)d,s ,t i&drIencBvl,o c&ks(etnhdr,e aadrIgdsx-.>xs)e,n dgbruofufp,( garrogusp-)>,r e c| v ^~~~~~~~~~~b uff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : swarning: tinitializer order does not match the declaration order [-Wreorder-ctor]e pSize(ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 916:7: 563note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here st e916p | S i z e ( n cpcrliSmhsm(egmr.ocuopmTmi.db,u fgfrSoiuzpeNst[hNrCeCaLd_sP,R O&TrOe_cSvI,M P&LsEe]n/dN,C CaLr_gSsT-E>PsSe/nsdibzuefoff,( Ta)r)g s{- > r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c v b| u group(groupf f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h7::202 :note: 53in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 916 | 202 | p r i m s (RgurnoWuoprTkiEdl,e mgernotur(g)s.-r>usne(nwdeb)u;f f ,| ^a rgs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cppe:c8v:b1u:f fnote: ,in instantiation of member function 'RunWork, 3, 2>::run' requested here | ^ 8 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P202L:_53C:O Lnote: Lin instantiation of member function 'RunWorkElement, 3, 2>::run' requested here_ FUN C202( | A l l R e d u c eR,u nCWOoLrLkNEElTe_mCeHnAtI| (^) .run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hw:e391):;95 : | note: ^expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp :39110 | : 1 :R unote: nin instantiation of member function 'RunWork, 3, 2>::run' requested hereW ork <10n | cIcMlPFLu_nCcO#L#Lf_uFnUcN,C (tAylpleR,e dFuuce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recIn file included from v/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cppb:u1f: fIn file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :a10r: gIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h-:>167r: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:A15r:g ,warning: initializer order does not match the declaration order [-Wreorder-ctor]0 , args->co n562n | I n d e xt,i da(rtgisd-)>,c onntnhIrnedaedxs)(;n t h| r ^e ads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :t80i:d5I:n Bnote: lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereo ck( t80h | r e a d Irduxn.Rxi)n,g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( a r| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s ); | ^563 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:S202i:z53e:( nnote: cin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herec lShm e202m | . c o m m . b u fRfuSniWzoersk[ENlCeCmLe_nPtRe(o)f.(rTu)n)( w{e ) ;| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hin instantiation of member function 'RunWork, 1, 2>::run' requested here: 34:7: 5note: | in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI MPL_COL L34_ | F U N C ( R epdruicmes,( tRiIdN,G ,n tShIrMePaLdEs,, P&rreiMnugl-S>upmr,e vu,i n&tr8i_ntg)- > n| e^x t, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g391s:-95>:s enote: nexpanded from macro 'IMPL_COLL_FUNC'd buff, a r391g | s - >RruencWvobrukf#r#efduOnpc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Arg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]o , NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Twarning: O initializer order does not match the declaration order [-Wreorder-ctor]562_ | # # p r o tt562oi | >d (( ) t. irtdui)nd,(( &tnnitcdhc)rl,Se ahndmteshm(rn.etwaodhrskr()ena;t dhs\r )e ,a | d ^ts )i,d It/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hni:Bd562Il:on15cB:l oknote: (ctfield 'nthreads' will be initialized after field 'tidInBlock'kh r(eta hd562rI | ed ax d. Ix d)tx,i. dxg()rt,oi udgp)r(,go runoptu(hpg)rr,eo aud ps| )( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~,n t | h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , 563t | i 563d | I n B sl tosectpkeS(pitSzhierz(eneac(cdnlIScdhcxml.eSxmh).m,ce omgm.mrc.oobumupmf(.fgbSruiofzufepSs)i[,zN e Cs| [ ^~~~~~~~~~~~~~~~~CN LC_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hPL:R_562OP:R60TO:OT _Onote: _field 'group' will be initialized after field 'stepSize'SS IIMM PP562LLE | ]E /] N/ CN CtCCLi_LSd_T(SEtPTiSEd/P)sS,i/ zsenoitfzh(erToe)fa)(d Ts{) () n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| d group(groups )| , group(group tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ho:c34k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:(7:t34:h :r7note: e:in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea note: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI d34x | . x ) , g r34po | ru i pm (sg ( rt oipudrp,i) m,ns (tt hi| rd ^~~~~~~~~~~e, a dnst,h r&eraidnsg,- >&prrienvg,- >&prrienvg,- >&nreixntg,- >anregxst-,> saerngdsb-u>fsfe,n dabrugfsf-,> raercgvsb-u>frfe,c vabrugfsf-,> raerdgOsp-A>rrge,d O0p,A ragr,g s0-,> caorngnsI-n>dceoxn,n Ianrdgesx-,> caorngnsI-n>dceoxn)n;I n d| e ^x ); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :80:5: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hnote: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here80 :5: 80note: | in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here r80u | n R i n gr (Parrogtso)>;( a r| g ^s ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here202 :53: note: 202in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | 202 | R u n W o r k ERluenmWeonrtk,( )P.rroutno(>w(e)).;r u n| ( ^w e); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp :8:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cppin instantiation of member function 'RunWork, 1, 2>::run' requested here: 7:1: 8note: | in instantiation of member function 'RunWork, 1, 2>::run' requested hereI MPL_ C7O | LILM_PFL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dIdx.x), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthr e563a | d s ( n tshtreepaSdisz)e, (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTOu_pS)I,M P L| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~] / N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_STEP S563/ | s i z e osft(eTp)S)i z{e ( n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l S| h group(groupm em.comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hb:u34f:f7S:i znote: ein instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres [NCCL _34P | R O T O _ S IpMrPiLmEs](/tNiCdC,L _nStThErPeSa/dssi,z e&orfi(nTg)-)> p{r e v| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ & r| i group(groupn g->next, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hg:s34-:>7s:e nnote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereb uff, args -34> | r e c v b u fpfr,i masr(gtsi-d>,r endtOhprAeragd,s ,0 ,& rairnggs-->>pcroenvn,I n&dreixn,g -a>rngesx-t>,c oanrngIsn-d>esxe)n;d b u| f ^f , ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hg:s80-:>5r:e cnote: vin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereb uff ,80 | a r g s -r>urneRdiOnpgAoctoon>n(Ianrdgesx),; a r| g ^s ->co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:n202I:n53d:e xnote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here; | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h : 80 :R5u:n Wnote: oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer kEl e80m | e n t < Frnu,n RTi,n gRo(>a(r)g.sr)u;n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp::538:: 1note: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 202 | 8 | I M P L _ C ORLuLn_WFoUrNkCE(lReemdeuncte<,F nR,I NTG,, RSeIdMOPpL,E ,A lPgroe,M uPlroto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562du:c15e:, warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]I NG, SIMPLE, P562r | e M u l Stuimd,( tiindt)6,4 _ntt)h r e| a^d s(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:391r:e95a:d snote: )expanded from macro 'IMPL_COLL_FUNC', tidIn B391l | o c kR(utnhWroerakds,t eNpCSCiLz_eA(LnGcOc_l#S#hamlegmo.,c oNmCmC.Lb_uPfRfOSTiOz_e#s#[pNrCoCtLo_>P(R)O.TrOu_nS(I&MnPcLcEl]S/hNmCeCmL._wSoTrEkP)S;/ s\i z e| o ^f (T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h{: 562 :| 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | field 'nthreads' will be initialized after field 'tidInBlock' group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h : 34t:i7d:( tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nth r34e | a d s ( n t hprreiamdss()t,i dt,i dnItnhBrleoac/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdk:s(562,t: h15&r:re iawarning: ndinitializer order does not match the declaration order [-Wreorder-ctor]gI -d>xp.rx e)562v, | , g &r ro iutnpig(d-g(>rtnoieudxp)t),,, na tr| hg ^~~~~~~~~~~~~~~~~rs e-a>d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:se562(n:nd60tb:hu rfnote: effield 'group' will be initialized after field 'stepSize'a, d sa)r,g s562t- | i> dr Ie nc Bvtlbioudcf(kft(,it dha)rr,eg asnd-tI>hdrrxee.daxOd)p,sA( rnggtr,ho reads), tidInBlock0(,t harregasd-I>dcxo.nxn)I,n dgerxo,u pa(rggrso-u>pc)o,n n I| n ^~~~~~~~~~~d ex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562:60: note: :field 'group' will be initialized after field 'stepSize'562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ WordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hc:o514n:n9I:n dwarning: evariable 'offset' set but not used [-Wunused-but-set-variable]x ); | ^ 514 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :i80n:t5 :o fnote: fin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heres et =80 | t i d ; r u| n ^R ing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 : 60| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: field 'group' will be initialized after field 'stepSize' 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePaLdEI]d/xN.CxC)L,_ SgTrEoPuSp/(sgirzoeuopf)(,T ) )| ^~~~~~~~~~~{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(In file included from threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cppd:x1.: x)In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :g10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:r167o: up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | pruinmWso(rtki#p#rdeevv,r e&droipnpnee>x,t ,N CaCrLg_sA-L>GsOe_n#d#baulfgfo,, aNrCgCsL-_>PrReOcTvOb_u#f#fp,r oatrog>s(-)>.rreudnO(p&Anrcgc,l S0h,m eamr.gwso-r>kc)o;n n\I n d| e ^x , args->conn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562d:e15x:) ;note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h562: | 80 : 5 : tnote: iin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hered (ti d80) | , n t hrruenaRdisn(gnl(oacrkg(st)h;r e a| d ^I dx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202u:p53(:g rnote: oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereu p), | 202 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n Work E562l | e m e n tt)(,) .triudnI(nwBel)o;c k (| t ^h readIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp):,5 :g1r:o unote: pin instantiation of member function 'RunWork, 1, 2>::run' requested here( grou p5) | ,I M P| L ^~~~~~~~~~~_ COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads (563n | t h r e asdtse)p,S itzied(InncBclloSchkm(etmh.rceoamdmI.dbxu.fxf)S,i zgerso[uNpC(CgLr_oPuRpO)T,O _ S| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~M P L| E tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)] /NCCL _563S | T E P S /sstiezpeSoifz(eT()n)c c{l S h| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e m .| c group(groupo mm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:z34e:s7[:N Cnote: Cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _PROTO_ S34I | M P L E ] / NpCrCiLm_sS(TtEiPdS,/ snitzheroefa(dTs),) &{r i n| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- > p| r group(groupe v, &ring->ne/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hx:t34,: 7a:r gnote: sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >sendb u34f | f , a r g sp-r>irmesc(vtbiudf,f ,n tahrrgesa-d>sr,e d&OrpiAnrgg-,> p0r,e va,r g&sr-i>ncgo-n>nnIenxdte,x ,a ragrsg-s>-s>ecnodnbnuIfnfd,e xa)r;g s -| > ^r ecvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hf:,80 :a5r:g snote: -in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here> red O80p | A r g , r0u,n Rairnggs<-T>,c oRnendIOnpd,e xP,r oatrog>s(-a>rcgosn)n;I n d| e ^x ); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :80: 5202: | note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80R | u n W o rrkuEnlReimnegn(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threaIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement:( )warning: .initializer order does not match the declaration order [-Wreorder-ctor]r un(we); | 562 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cppt:i4d:(1t:i dnote: )in instantiation of member function 'RunWork, 1, 2>::run' requested here, nt h4r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( RteidduIcneB,l oRcIkN(Gt,h rSeIaMdPILdEx,. xM)i,n ,g rionutp8(_gtr)o u p| )^, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 391 :| 95 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: expanded from macro 'IMPL_COLL_FUNC' 563 | 391 | sRtuenpWSoirzke<(nnccccllFSuhnmce#m#.fcuonmcm,. btuyfpfeS,i zFeusn[cN#C#CdLe_vPrReOdToOp_E,] /NNCCCCLL__ASLTGEOP_S#/#sailzgeoo,f (NTC)C)L _{P | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R O T| O group(group_ ##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h(:)34.:r7u:n (note: &in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren cclShm e34m | . w o r k ) ;p r\i m s| ( ^t id, nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s ,note: field 'nthreads' will be initialized after field 'tidInBlock'& ring- >562p | r e v , t&irdi(ntgi-d>)n,e xntt,h raeragdss-(>nstehnrdebaudfsf),, atrigdsI-n>Brleoccvkb(utfhfr,e aadrIgdsx-.>xr)e,d OgprAorugp,( g0r,o uapr)g,s - >| c ^~~~~~~~~~~~~~~~~o nnI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562e:x60,: anote: rfield 'group' will be initialized after field 'stepSize'g s->co n562n | I n d e xt)i;d ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hh:r80e:a5d:s (note: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heret hre a80d | s ) , triudnIRniBnlgo (garrogusp)(;g r o| u ^p ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:N562G:,15 :S Iwarning: Minitializer order does not match the declaration order [-Wreorder-ctor]P LE, Min, i562n | t 3 2 _ tt)i d (| t^i d), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: (expanded from macro 'IMPL_COLL_FUNC'n thread s391) | , tRiudnIWnoBrlko, 563N | C C L _ AsLtGeOp_S#i#zael(gnoc,c lNSChCmLe_mP.RcOoTmOm_.#b#upfrfoStioz>e(s)[.NrCuCnL(_&PnRcOcTlOS_hSmIeMPmL.Ew]o/rNkC)C;L _\S T E| P ^S /size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:f562(:T15):) note: {field 'nthreads' will be initialized after field 'tidInBlock' | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 562 group(group | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h(:t34i:d7):, note: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads( n34t | h r e a d s )p,r itmisd(ItniBdl,o cnkt(htrheraedasd,I d&xr.ixn)g,- >gprroeuvp,( g&rroiunpg)-,> n e| x ^~~~~~~~~~~~~~~~~t , a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-60>:s enote: nfield 'group' will be initialized after field 'stepSize'd buff ,562 | a r g s -t>irde(ctvibdu)f,f ,n tahrrgesa-d>sr(endtOhprAeragd,s )0,, tairdgIsn-B>lcoocnkn(Itnhdreexa,d Iadrxg.sx-)>,c ognrnoIunpd(egxr)o;u p )| , ^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:r ewarning: cinitializer order does not match the declaration order [-Wreorder-ctor]v buff, ar g562s | - > r e dtOipdA(rtgi,d )0,, natrhgrse-a>dcso(nnntIhnrdeeaxd,s )a,r gtsi-d>IcnoBnlnoIcnkd(etxh)r;e a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :g80r:o5u:p (note: gin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer oup )80, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ r u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R ing, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r15u:n (warning: winitializer order does not match the declaration order [-Wreorder-ctor]e ); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp :t11i:d1(:t inote: din instantiation of member function 'RunWork, 1, 2>::run' requested here) , nt h11r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( RteidduIcneB,l oRcIkN(Gt,h rSeIaMdPILdEx,. xM)i,n ,g rfoluopa(tg)r o u| p^) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)95 : note: expanded from macro 'IMPL_COLL_FUNC' 563 | 391 | s t eRpuSniWzoer(kn/,N CNCCLC_LS_TAELPGSO/_s#i#zaelogfo(,T )N)C C{L _ P| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O T O| _ group(group# #proto>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h.:r34u:n7(:& nnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec lShmem.w o34r | k ) ; \ p| r ^i ms(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds, &562r | i n g - >tpirde(vt,i d&)r,i nngt-h>rneeaxdts,( natrhgrse-a>dsse)n,d btuidInfBfl,o cakr(gtsh-r>eraedcIvdbxu.fxf),, agrrgosu-p>(rgerdoOuppA)r,g , | 0 ^~~~~~~~~~~~~~~~~, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:c60o:n nnote: Ifield 'group' will be initialized after field 'stepSize'n dex, a562r | g s - > ctoindn(Itnidde)x,) ;n t h| r ^e ads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:e80a:d5s:) ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heret idIn B80l | o c k ( trhurneRaidnIgdp()a,r g s| ) ^~~~~~~~~~~; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads )563, | t idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connInrdienxg,- >anregxst-,> caorngnsI-n>dseexn)d;b u f| f ^, args->recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 80a:r5g:s -note: >in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer edOpA r80g | , 0 , raurngRsi-n>gc>c(oanrngIsn)d;e x )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here: 80:5: note: 202in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here | 80 | R u nrWuonrRkiEnlgep(,a rAglsg)o;, P| r ^o to>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(202w:e53):; note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here| ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp : 11 : 1 : Rnote: uin instantiation of member function 'RunWork, 1, 2>::run' requested heren Work E11l | eImMePnLt_M(P)L.Er,u nM(iwne,) ;f l o| a ^t ) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp :9:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'RunWork, 1, 2>::run' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC'9 | IMPL_C O391L | L _ FRUuNnCW(oRrekd:,391 :N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'A LGO_##a l391g | o , RNuCnCWLo_rPkRu(n)c.,r utny(p&en,c cFluSnhcm#e#md.ewvorrekd)o;p <\t y p| e ^> , NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #note: afield 'nthreads' will be initialized after field 'tidInBlock'l go, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdk); \I n B| l ^o ck(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group(g r562o | u p) , t| i ^~~~~~~~~~~~~~~~~d (t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,60 :n tnote: hfield 'group' will be initialized after field 'stepSize'r eads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g60r:o unote: pfield 'group' will be initialized after field 'stepSize'( group) ,562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c15e:, warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]I NG, SIMPL E562, | M i n ,t ihda(ltfi)d ) ,| ^n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95(:n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads), 391t | i d IRnuBnlWoocrkk( | , N C CsLt_eApLSGiOz_e#(#naclcgloS,h mNeCmC.Lc_oPmRmO.TbOu_f#f#Spirzoetso[>N(C)C.Lr_uPnR(O&TnOc_cSlISMhPmLeEm]./wNoCrCkL)_;S T\E P S| / ^s izeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: note: | field 'nthreads' will be initialized after field 'tidInBlock' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:(34t:i7d:) ,note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren threads (34n | t h r e a d sp)r,i mtsi(dtIindB,l onctkh(rtehardesa,d I&drxi.nxg)-,> pgrreovu,p (&grrionugp-)>,n e x| t ^~~~~~~~~~~~~~~~~, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s60e:n dnote: bfield 'group' will be initialized after field 'stepSize'u ff, a r562g | s - > r etcivdb(utfifd,) ,a rngtsh-r>eraeddsO(pnAtrhgr,e a0d,s )a,r gtsi-d>IcnoBnlnoIcnkd(etxh,r eaardgIsd-x>.cxo)n,n Ignrdoeuxp)(;g r o| u ^p ), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h ^~~~~~~~~~~: 80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' ndbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,562 | N C C L _tAiLdG(Ot_i#d#)a,l gnot,h rNeCaCdLs_(PnRtOhTrOe_a#d#sp)r,o ttoi>d(I)n.Brluonc(k&(ntchcrleSahdmIedmx..wxo)r,k )g;r o\u p (| g ^r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 group(group: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 666562: | 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id(ti d666) | , n t h r e a dpsr(inmtsh(rteiadd,s )n,T htriedaIdnsBGlaotchke(rt,h rdeiardeIcdtx-.>xu)p,, gNrUoLuLp,( garrogusp-)>,s e n| d ^~~~~~~~~~~b uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ut, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:(562n:c15c:l Swarning: hinitializer order does not match the declaration order [-Wreorder-ctor]m em.comm .562b | u f f S itzieds([tNiCdC)L,_ PnRtOhTrOe_aSdIsM(PnLtEh]r/eNaCdCsL)_,S TtEiPdSI/nsBilzoecokf((tTh)r)e a{d I d| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. x )| , group(group group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p655):,11 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 655 | 563 | s t e ppSriizmes((ntcicdl-SthimdeSmt.acrotmRme.dbuucfef,S inzTehsr[eNaCdCsLR_ePdRuOcTeO,_ SnIuMlPlLpEt]r/,N C&CdLi_rSeTcEtP-S>/osuitz,e oafr(gTs)-)> s{e n d| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| , group(group args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:c641v:b11u:f fnote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ims( t202i | d - t i d S t a rRtuRneWdourckeE,l enmTehnrte,d oPwrno,t o&>d(i)r.ercutn-(>woeu)t;, a| r ^g s->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppb:u4f:f1,: anote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereg s->r e4c | vIbMuPfLf_,C O L| L ^_ FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:R202e:d53u:c enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here COL L202N | E T _ D I R E C TR,u nSWIoMrPkLEEl,e mMeanxt,< Finn,t 8T_,t )R e d| O^p , Alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:,391 :P95r:o tnote: oexpanded from macro 'IMPL_COLL_FUNC'> ().run (391w | e ) ;R u n| W ^o rk, 2, 2>::run' requested hereu nc, 4t | yIpMeP,L _FCuOnLcL#_#FdUeNvCr(eAdlolpR,, CNOCLCLLN_EATL_GDOI_R#E#CaTl,g oS,I MNPCLCEL,_ PMRaOxT,O _i#n#tp8r_ott)o > (| )^. run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c391c:l95S:h mnote: eexpanded from macro 'IMPL_COLL_FUNC'm .work); \391 | | ^R unWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:<562n:c15c:l Fnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n c##f u562n | c , t ytpied,( tFiudn)c,# #ndtehvrreeaddosp(a,d sN)C,C Lt_iAdLIGnOB_l#o#cakl(gtoh,r eNaCdCILd_xP.RxO)T,O _g#r#opurpo(tgor>o(u)p.)r,u n (| & ^~~~~~~~~~~~~~~~~n cc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: wfield 'group' will be initialized after field 'stepSize'o rk); 562\ | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~a dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] In file included from 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 1 : In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d10(: tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:)167,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]( nthreads), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I n| B tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~b u f| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S izes[ N563C | C L _ P RsOtTeOp_SSiIzMeP(LnEc]c/lNSChCmLe_mS.TcEoPmSm/.sbiuzfefoSfi(zTe)s)[ N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P R O| T group(groupO _SIMPLE]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:E687P:S11/:s inote: zin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree of(T) )687 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(tid-tidStart/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:c626a:s9t:, note: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT hread s626B | c a s t , & d iprreicmts-(>toiudt-,t induSltlaprttrS,c aatrtgesr-,> sneTnhdrbeuafdfs,S caartgtse-r>,r eNcUvLbLu,f fd,i r e| c ^t ->up, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:-53>:s enote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered buff ,202 | a r g s - > r e cRvubnuWfofr,k E l| e ^m ent, 2, 2>::run' requested here, Alg o202, | P r o t o > ( )R.urnuWno(rwkeE)l;e m e| n ^t , 2, 2>::run' requested herep , A l4g | oI,M PPLr_oCtOoL>L(_)F.UrNC(AllReducuen,( wCeO)L;L N E| T ^_ DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppS:I4M:P1L:E ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereM ax, i4n | tI8M_PtL)_ C O| L^L _FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:(391A:l95l:R enote: dexpanded from macro 'IMPL_COLL_FUNC'u ce, C O391L | L N ERTu_nDWIoRrEkC, NCCL _391A | L G OR_u#n#Waolrgko<,n cNcClCFLu_nPcR#O#TfOu_n#c#,p rtoytpoe>,( )F.urnucn#(#&dnecvcrleSdhompek,) ;N C\C L _| A ^L GO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:l562g:o15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~d (ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:)562,: 60g:r onote: ufield 'group' will be initialized after field 'stepSize'p (grou p562) | , | ^~~~~~~~~~~t id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~d x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h note: expanded from macro 'IMPL_COLL_FUNC' :562: 15391: | warning: initializer order does not match the declaration order [-Wreorder-ctor]R unWork,, tNiCdCILn_BAlLoGcOk_(#t#harlegaod,I dNxC.CxL)_,P RgOrToOu_p#(#gprrooutpo)>,( ) .| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n (| & tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n cclSh m563e | m . w o rskt)e;p S\i z e| ( ^n cclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:c15o:m mnote: .field 'nthreads' will be initialized after field 'tidInBlock'b uffS i562z | e s [ N CtCiLd_(PtRiOdT)O,_ SnItMhPrLeEa]d/sN(CnCtLh_rSeTaEdPsS)/,s itziedoIfn(BTl)o)c k{( t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| I group(groupd x.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u655p:)11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562655: | 60 : note: field 'group' will be initialized after field 'stepSize' 562p | r i m s (ttiidd(-ttiidd)S,t anrtthRreedaudcse(,n tnhTrheraedasd)s,R etdiudcIen,B lnouclkl(ptthrr,e a&ddIidrxe.cxt)-,> ogurto,u pa(rggrso-u>ps)e,n d b| u ^~~~~~~~~~~f f, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:n562t:815_:t )warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), n391t | h r eRaudnsW(onrtkhu,p )N,C C L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A L G| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##al g563o | , N C CsLt_ePpRSOiTzOe_(#n#cpcrloSthom>e(m)..croumnm(.&bnucfcflSSihzmeesm[.NwCoCrLk_)P;R O\T O _| S ^I MPLE]/NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:S15T:E Pnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'/ sizeof( T562) | ) { t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ( t| i group(groupd ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r687e:a11d:s (note: nin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads) ,687 | t i d I n B l o c k (ptrhirmesa(dtIiddx-.txi)d,S tgarrotuBpc(agsrto,u pn)T,h r e| a ^~~~~~~~~~~~~~~~~d sBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562,: 60&:d inote: rfield 'group' will be initialized after field 'stepSize'e ct->o u562t | , n u ltlipdt(rt,i da)r,g sn-t>hsreenaddbsu(fnft,h raeragdss-)>,r etcivdbIunfBfl,o c k| ( ^t hreadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:x202):,53 :g rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu p(gro u202p | ) , | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ent().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdInB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: (field 'group' will be initialized after field 'stepSize'n thre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a dIdx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~h mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgo:,562 :N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]P ROTO_##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreema.dwso(rnkt)h;r e\a d s| ) ^, tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c knote: (field 'nthreads' will be initialized after field 'tidInBlock't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL_ S562T | E P S / stiizde(otfi(dT)),) n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ( group(groupn threads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t677i:d11I:n Bnote: lin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo ck(thr e677a | d I d x . x ) , g rporuipm(sg(rtoiudp-)t,i d S| t ^~~~~~~~~~~a rtBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s (warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~l ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run( w562e | ) ; | t ^i d(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp,: 6n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s(n t6h | rIeMaPdLs_)C,O LtLi_dFIUnNBCl(oAclkl(Rtehdruecaed,I dCxO.LxL)N,E Tg_rDoIuRpE(CgTr,o uSpI)M,P L E| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ M a| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), int3 2563_ | t ) | s^t epSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391(:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.co m391m | . b uRfufnSWiozreks<[nNcCcClLF_uPnRcO#T#Of_uSnIcM,P LtEy]p/eN,C CFLu_nScT#E#PdSe/vsriezdeoopf<(tTy)p)e >{, N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| A group(groupL GO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C677C:11: note: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ PROTO_ #677# | p r o t o > ( ) . r upnr(i&mnsc(ctliSdh-mteimd.Swtoarrkt)B;c a\s t ,| ^n Threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:c562a:s15t:, note: &field 'nthreads' will be initialized after field 'tidInBlock'd irect -562> | o u t , tdiidr(etcitd-)>,d onwtnh,r eaardgss(-n>tshernedabdusf)f,, tairdgIsn-B>lroecckv(btuhfrfe,a d I| d ^x .x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202(:g53r:o unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , | ^~~~~~~~~~~~~~~~~202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n WorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppx:.7x:)1,: gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up(g r7o | uIpM)P,L _ C| O ^~~~~~~~~~~L L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~t idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d InBlock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adIdx .563x | ) , g rsotuepp(Sgirzoeu(pn)c,c l S| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m e m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c omm.bu f563f | S i z e ss[tNeCpCSLi_zPeR(OnTcOc_lSSIhMmPeLmE.]c/oNmCmC.Lb_uSfTfESPiSz/essi[zNeCoCfL(_TP)R)O T{O _ S| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| E group(group] /NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:S641/:s11i:z enote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref (T)) { 641 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(666t:i9d:- tnote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered Star t666R | e d u c e , n Tphrriemasd(stRiedd,u cneT,h rdeiardescGta-t>hdeorw,n ,d i&rdeicrte-c>tu-p>,o uNtU,L La,r gasr-g>ss-e>nsdebnudfbfu,f fa,r gasr-g>sr-e>crvebcuvfbfu,f f ,| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ endbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ go, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :warning: 202initializer order does not match the declaration order [-Wreorder-ctor]: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562202 | | t i d ( tRiudn)W,o rnktEhlreemaednst(a(d)I.drxu.nx()w,e )g;r o u| p ^( group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp):,7 : 1| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 7 | I563M | P L _ C OsLtLe_pFSUiNzCe((AnlclcRleSdhumceem,. cCoOmLmL.NbEuTf_fDSIiRzEeCsT[,N CSCILM_PPLREO,T OM_aSxI,M PuLiEn]t/3N2C_CtL)_ S T| E^P S/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391o:f95(:T )note: )expanded from macro 'IMPL_COLL_FUNC' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391| | group(group RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h<:n655c:c11l:F unote: nin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec ##func ,655 | t y p e , F u n c #p#rdiemvsr(etdiodp-a,r tNRCeCdLu_cAeL,G On_T#h#raelagdos,R eNdCuCcLe_,P RnOuTlOl_p#t#rp,r o&tdoi>r(e)c.tr-u>no(u&tn,c calrSghsm-e>ms.ewnodrbku)f;f ,\ a r| g ^s ->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:b562u:f15f:, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53t:i dnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id), 202n | t h r e a d s ( nRtuhnrWeoardksE)l,e mteindtI((g)r.oruupn)(,w e )| ; ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppfield 'group' will be initialized after field 'stepSize': 7:1: note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here | t7i | dI(MtPiLd_)C,O LnLt_hFrUeNaCd(sA(lnltRherdeuacdes,) ,C OtLiLdNIEnTB_lDoIcRkE(CtTh,r eSaIdMIPdLxE.,x )M,a xg,r ouuipn(tg3r2o_utp)) , | ^| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,t iNdC(CtLi_dA)L,G On_t#h#raelagdos,( nNtChCrLe_aPdRsO)T,O _t#i#dpIrnoBtloo>c(k)(.trhurne(a&dnIcdcxl.Sxh)m,e mg.rwoourpk()g;r o\u p )| , ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 group(group: 60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t626i:d9(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nthr e626a | d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthSrceaatdtIedrx,. xn)T,h rgeraoduspS(cgartotuepr),, N U| L ^~~~~~~~~~~L , direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threa d563s | ) , t isdtIenpBSliozcek((ntchcrleSahdmIedmx..cxo)m,m .gbruofufpS(igzreosu[pN)C,C L _| P ^~~~~~~~~~~R OTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | priLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thms(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork | , N C CtLi_dA(LtGiOd_)#,# anltghor,e aNdCsC(Ln_tPhRrOeTaOd_s#)#,p rtoitdoI>n(B)l.orcukn((t&hnrcecaldSIhdmxe.mx.)w,o rgkr)o;u p\( g r| o ^u p), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 : 15| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s tteipdS(itzied()n,c cnltShhrmeeamd.sc(onmtmh.rbeuafdfsS)i,z etsi[dNICnCBLl_oPcRkO(TtOh_rSeIaMdPILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~~~~~~~) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~60 : | note: group(groupfield 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626 :t9i:d (note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d), nt h626r | e a d s ( n t h rperaidmss)(,t itdi-dtIindBSltoacrkt(StchartetaedrI,d xn.Txh)r,e agdrsoSucpa(tgtreoru,p )N,U L L| , ^~~~~~~~~~~ direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement(CL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :R15u:n Wwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]r kElements(()n.trhurne(awdes));, t| i ^d InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp(:t8h:r1e:a dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hered x.x) ,8 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uce, C563O | L L N E Ts_tDeIpRSEiCzTe,( nScIcMlPSLhEm,e mM.acxo,m mi.nbtu6f4f_Sti)z e s| [^N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:R391O:T95O:_ Snote: Iexpanded from macro 'IMPL_COLL_FUNC'M PLE]/ N391C | C L _RSuTnEWPoSr/ks, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e>, N666C | C L _ A L G O _ #p#railmgso(,t iNdC,C Ln_TPhRrOeTaOd_s#G#aptrhoetro,> (d)i.rreucnt(-&>nucpc,l SNhUmLeLm,. waorrgks)-;> s\e n d| b ^u ff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s15-:> rnote: efield 'nthreads' will be initialized after field 'tidInBlock'c vbuf f562, | | ^ tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202n:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered s(nt h202r | e a d s ) , t iRduInnWBolrokcEkl(etmherneta().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnWork:<562n:c15c:l Fwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]n c##func, type, F562u | n c # # dteivdr(etdiodp)<,t ynpteh>r,e aNdCsC(Ln_tAhLrGeOa_d#s#)a,l gtoi,d INnCBClLo_cPkR(OtThOr_e#a#dpIrdoxt.ox>)(,) .grruonu(p&(ngcrcoluSph)m,e m .| w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o r k| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T); \ | ^563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e15p:S inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e (nccl S562h | m e m . ctoimdm(.tbiudf)f,S inztehsr[eNaCdCsL(_nPtRhOrTeOa_dSsI)M,P LtEi]d/INnCBClLo_cSkT(EtPhSr/esaidzIedoxf.(xT)),) g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o group(groupu p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hfield 'group' will be initialized after field 'stepSize': 626:9: note: 562in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ti d626( | t i d ) , n t hprreiamdss((tnitdh-rteiaddSst)a,r ttSicdaItntBelro,c kn(TthhrreeaaddsISdcxa.txt)e,r ,g rNoUuLpL(,g rdoiurpe)c,t - >| u ^~~~~~~~~~~p , args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ms(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:n562c:c15l:F uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]c ##func, ty p562e | , F u ntci#d#(dteivdr)e,d onpts,( nNtChCrLe_aAdLsG)O,_ #t#iadlIgnoB,l oNcCkC(Lt_hPrReOaTdOI_d#x#.pxr)o,t og>r(o)u.pr(ugnr(o&unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m .| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk); 563\ | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562(:n15c:c lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'h mem.co m562m | . b u f ftSiidz(etsi[dN)C,C Ln_tPhRrOeTaOd_sS(InMtPhLrEe]a/dNsC)C,L _tSiTdEIPnSB/lsoiczke(otfh(rTe)a)d I{d x .| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | g group(groupr oup(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,641 : 11| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :641562 | : 60 : note: field 'group' will be initialized after field 'stepSize' pr i562m | s ( t i dt-itdi(dtSitda)r,t Rnetdhurceea,d sn(TnhtrheraedasdRse)d,u ctei,d IdniBrleocctk-(>tdhorwena,d I&ddxi.rxe)c,t -g>roouutp,( garrogusp-)>,s e n| d ^~~~~~~~~~~b uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p(group), 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbu| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~I nBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd Idx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthr e563a | d s ) , sttiedpISniBzleo(cnkc(ctlhSrhemaedmI.dcxo.mxm).,b ugfrfoSuipz(egsr[oNuCpC)L,_ P R| O ^~~~~~~~~~~T O_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562W:o15r:k Ewarning: linitializer order does not match the declaration order [-Wreorder-ctor]e ments(()n.trhurne(awdes));, t| i ^d InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp(:t9h:r1e:a dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hered x.x), 9g | rIoMuPpL(_gCrOoLuLp_)F,U N C| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A l l| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e duc e563, | C O L LsNtEeTp_SDiIzReE(CnTc,c lSSIhMmPeLmE.,c oMmamx.,b uufifnSti6z4e_st[)N C C| L^_ PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:S391I:M95P:L Enote: ]expanded from macro 'IMPL_COLL_FUNC'/ NCCL_ST E391P | S / sRiuzneWoofr(kT<)n)c c{l F u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c # #| f group(groupu nc, typ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:,687 :F11u:n cnote: #in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# devr e687d | o p < t y p e > , NpCrCiLm_sA(LtGiOd_-#t#iadlSgtoa,r tNBCcCaLs_tP,R OnTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : warning: sinitializer order does not match the declaration order [-Wreorder-ctor]t epSize(n c562c | l S h m etmi.dc(otmimd.)b,u fnftShirzeeasd[sN(CnCtLh_rPeRaOdTsO)_,S ItMiPdLIEn]B/lNoCcCkL(_tShTrEePaSd/Isdixz.exo)f,( Tg)r)o u{p ( g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ) group(group, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:e641p:S11i:z enote: (in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren cclShm e641m | . c o m m . b u f f Spirziemss[(NtCiCdL-_tPiRdOSTtOa_rStIRMePdLuEc]e/,N CnCTLh_rSeTaEdPsSR/esdiuzceeo,f (dTi)r)e c{t - >| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o w n| , group(group &direct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:o626u:t9,: anote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s->se n626d | b u f f , a r gpsr-i>mrse(ctvibdu-ftfi,d S t| a ^r tScatt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:r202,: 53n:T hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree adsS c202a | t t e r , N U LRLu,n WdoirrkeEclte-m>eunpt,< Fanr,g sT-,> sReenddObpu,f fA,l gaor,g sP-r>orteoc>v(b)u.frfu,n ( w| e ^) ; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppnote: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here8 :1: note: 202in instantiation of member function 'RunWork, 2, 2>::run' requested here | 8 | I MRPuLn_WCoOrLkLE_lFeUmNeCn(tAS(I)M.PrLuEn,( wMea)x;, i| n ^t 64_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :| 7^: 1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 391:95: 7note: | expanded from macro 'IMPL_COLL_FUNC'I MPL_CO L391L | _ F URNuCn(WAolrlkR_,t )N C C| L^_ ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_391#:#95a:l gnote: oexpanded from macro 'IMPL_COLL_FUNC', NCCL_ P391R | O T OR_u#n#Wporroktc(c)l.Fruunnc(#&#nfcucnlcS,h mteymp.ew,o rFku)n;c #\# d e| v ^r edop:, note: Nfield 'nthreads' will be initialized after field 'tidInBlock'C CL_A L562G | O _ # # atligdo(,t iNdC)C,L _nPtRhOrTeOads(nthreads), tidInBlock(thread_I#d#xp.rxo)t,o >g(r)o.urpu(ng(r&onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~m .w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)60;: \note: field 'group' will be initialized after field 'stepSize' | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'( tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~o up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs(:t562i:d15-:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]St artScatter, nT h562r | e a d s Stciadt(tteird,) ,N UnLtLh,r edaidrse(cntt-h>ruepa,d sa)r,g st-i>dsIennBdlboucfkf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p| ( ^g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202 :| 53 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | s t eRpuSniWzoer(knEclcelmSehnmteO(_)S.IrMuPnL(Ew]e/)N;C C L| _ ^S TEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpps:i9z:e1o:f (note: Tin instantiation of member function 'RunWork, 2, 2>::run' requested here) ) { 9| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(group_ COLL_FUNC(AllRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:u655c:e11,: Cnote: Oin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL LNET_D I655R | E C T , S I M P L Ep,r iMmasx(,t iudi-ntti6d4S_tta)r t R| e^d uce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:T391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'R educe, 391n | u l lRputnrW,o r&kdnocu#t#,f uanrcg,s -t>yspeen,d bFuufnfc,# #adregvsr-e>droepc,, N| C ^C L_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:#202#:a53l:g onote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here NCCL _202P | R O T O _ # # p rRoutnoW>o(r)k.Erluenm(e&nntc562(:)15.:r unote: nfield 'nthreads' will be initialized after field 'tidInBlock'( we); 562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt:i9d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh re a9d | sI(MnPtLh_rCeOaLdLs_)F,U NtCi(dAIlnlBRleodcukc(et,h rCeOaLdLINdExT._xD)I,R EgCrTo,u pS(IgMrPoLuEp,) ,M a x| , ^~~~~~~~~~~~~~~~~ ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t5626:460_:t )note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95t:i dnote: (expanded from macro 'IMPL_COLL_FUNC't id), n t391h | r e aRdusn(Wnotrhkrp,) ,N C C| L ^~~~~~~~~~~_ ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here562 | 202t | i d ( t i d ) , RnutnWorkElemhernetah(r)e.arduInd(xw.ex));, g| r ^o up(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:p9):,1 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 9 | IM P563L | _ C O L Ls_tFeUpNSCi(zAel(lnRcecdluSchem,e mC.OcLoLmNmE.Tb_uDfIfRSEiCzTe,s [SNICMCPLL_EP,R OMTaOx_,S IuMiPnLtE6]4/_NtC)C L _| S^T EPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:z391e:o95f:( Tnote: )expanded from macro 'IMPL_COLL_FUNC') { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 | | group(group RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #func, 655t | y p e , F u n c # #pdreivmrse(dtoipd<-ttyipdeS>t,a rNtCRCeLd_uAcLeG,O _n#T#harlegaod,s RNeCdCuLc_eP,R OnTuOl_l#p#tprr,o t&od>i(r)e.crtu-n>(o&untc,c laSrhgmse-m>.sweonrdkb)u;f f\, a| r ^g s->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fnote: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~) ; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :10:1: 562note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here t i10d | (ItMiPdL)_,C OnLtLh_rFeUaNdCs((AnltlhRreedaudcse),, CtOiLdLINnEBTl_oDcIkR(EtChTr,e aSdIIMdPxL.Ex,) ,M agxr,o uhpa(lgfr)o u p| )^, | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m ewarning: minitializer order does not match the declaration order [-Wreorder-ctor]. comm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(641g:r11o:u pnote: )in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 641 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | 563 | p rsitmesp(Stiizde-(tnicdcSltSahrmteRme.dcuocmem,. bnuTfhfrSeiazdessR[eNdCuCcLe_,P RdOiTrOe_cStI-M>PdLoEw]n/,N C&CdLi_rSeTcEtP-S>/osuitz,e oafr(gTs)-)> s{e n d| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| , group(group args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:v666b:u9f:f ,note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 :p rnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem s(tid ,202 | n T h r e a d s GRautnhWeorr,k Edliermeecntt-<>Funp,, TN,U LRLe,d Oapr,g sA-lgo, Pr>osteon>d(b)u.frfu,n (awreg)s;- > r| e ^c vbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :| 10 ^: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 20210: | 53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ COLL _202F | U N C ( A l l R eRduuncWeo,r kCEOlLeLmNeEnTt_ ( )| .^r un(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:)391;: 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp: 10391: | 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work <10n | cIcMlPFLu_nCcO#L#Lf_uFnUcN,C (tAylpleR,e dFuucnec,# #CdOeLvLrNeEdTo_pD,, SNICMCPLL_EA,L GMOa_x#,# ahlaglof,) N C| C^L _PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:O391_:#95#:p rnote: oexpanded from macro 'IMPL_COLL_FUNC't o>().r u391n | ( & nRcucnlWSohrmked,( tNiCdC)L,_ AnLtGhOr_e#a#dasl(gnot,h rNeCaCdLs_)P,R OtTiOd_I#n#Bplrooctko(>t(h)r.eraudnI(d&xn.cxc)l,S hgmreomu.pw(ogrrko)u;p )\, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6015:: note: note: field 'group' will be initialized after field 'stepSize'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i dR)u,n WnotrhkrEelaedmse(nnttI(d)x..rxu)n,( wger)o;u p (| g ^r oup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~9 : 1| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 563 | 9 | I M PsLt_eCpOSLiLz_eF(UnNcCc(lASlhlmReemd.uccoem,m .CbOuLfLfNSEiTz_eDsI[RNECCCTL,_ PSRIOMTPOL_ES,I MMPaLxE,] /uNiCnCtL6_4S_TtE)P S /| s^i zeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:T391):)95 :{ note: expanded from macro 'IMPL_COLL_FUNC'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hW:o641r:k11<:n cnote: cin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel Func## f641u | n c , t y p e , Fpurnicm#s#(dteivdr-etdiodpSe,d uNcCeCL,_ AnLTGhOr_e#a#daslRgeod,u cNeC,C Ld_iPrReOcTtO-_>#d#opwrno,t o&>d(i)r.ercutn-(>&onuctc,l Sahrmgesm-.>wsoernkd)b;u f\f , | a ^r gs->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fnote: ,field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53(:t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , nth r202e | a d s ( n t h r eRaudnsW)o,r ktEildeImneBnltou(p)).,r u n| ( ^~~~~~~~~~~~~~~~~w e);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^60 : note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp: 8562: | 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested heret id(t i8d | )I,M PnLt_hCrOeLaLd_sF(UnNtCh(rAelaldRse)d,u ctei,d ICnOBLlLoNcEkT(_tDhIrReEaCdTI,d xS.IxM)P,L Eg,r oMuapx(,g rionutp6)4,_ t )| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), 563| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s tepSi z563e | ( n c c lsSthempeSmi.zceo(mnmc.cbluSfhfmSeimz.ecso[mNmC.CbLu_fPfRSOiTzOe_sS[INMCPCLLE_]P/RNOCTCOL__SSITMEPPLSE/]s/iNzCeCoLf_(STT)E)P S{/ s i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e o f| ( group(groupT )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 655in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | 655 | p r i m s (ptriidm-st(itdiSdt-atritdRSetdaurcteR,e dnuTcher,e andTshRreedaudcseR,e dduicree,c tn-u>ldlopwtnr,, &&ddiirreecctt-->>oouutt,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp::1010::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 1010 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, MMaaxx,, hhaallff)) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391::39195::95 :note: expanded from macro 'IMPL_COLL_FUNC'note: expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<r,e dNoCpCO,_ #N#CaClLg_oA,L GNOC_C#L#_aPlRgOoT,O _N#C#CpLr_oPtRoO>T(O)_.#r#upnr(o&tnoc>c(l)S.hrmuenm(.&wnocrckl)S;h m\e m .| w ^o rk); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>15(:) .warning: rinitializer order does not match the declaration order [-Wreorder-ctor]u n(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd s), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thr e563a | d I d x .sxt)e,p Sgirzoeu(pn(cgcrloSuhpm)e,m . c| o ^~~~~~~~~~~~~~~~~m m.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(gr o563u | p ) , s| t ^~~~~~~~~~~e pSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhmem.:w562o:r15k:) ;warning: initializer order does not match the declaration order [-Wreorder-ctor]\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up) ,563 | | ^~~~~~~~~~~~~~~~~ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562i:z60e:( nnote: cfield 'group' will be initialized after field 'stepSize'c lShmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePaRdOsT(On_tShIrMePaLdEs])/,N CtCiLd_ISnTBElPoSc/ks(itzheroefa(dTI)d)x .{x ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp (group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626t:i9d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nthr e626a | d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthSrceaatdtIedrx,. xn)T,h rgeraoduspS(cgartotuepr),, N U| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L , | d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i rect -563> | u p , asrtgesp-S>iszeen(dnbcucflfS,h maermg.sc-o>mrme.cbvubfuffSfi,z e s| [ ^N CCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:_202S:I53M:P Lnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here] /NCC L202_ | S T E P S / s i zReuonfW(oTr)k)E l{e m e| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t < F| n group(group, T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:p677,: 11A:l gnote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, Prot o677> | ( ) . r u n ( w e ) ;p r i| m ^s (tid-ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:S11t:a1r:t Bnote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herea st, n11T | hIrMePaLd_sCBOcLaLs_tF,U N&Cd(iArlelcRte-d>uocuet,, CdOiLrLeNcEtT-_>DdIoRwEnC,T ,a rSgIsM-P>LsEe,n dMbauxf,f ,f laoragts)- > r| e^c vbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :39153 | : note: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu nWor k202< | n c c l F u n c #R#ufnuWnocr,k Etlyepmee,n tF(>),. rNuCnC(Lw_eA)L;G O _| # ^# algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppN:C9C:L1_:P Rnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested hereT O_# #9p | rIoMtPoL>_(C)O.LrLu_nF(U&NnCc(cAllSlhRmeedmu.cweo,r kC)O;L L\N E T| _ ^D IRECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' Max, u562i | n t 6 4 _tti)d ( t| i^d ), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: (expanded from macro 'IMPL_COLL_FUNC'n threa d391s | ) , RtuindWIonrBkl:,60 :N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _ALGO _562# | # a l g o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht,:i 562dN:C(15Ct:Li _dwarning: P)initializer order does not match the declaration order [-Wreorder-ctor],R O TnOt_h#r #e562ap | dr so (t no t>th(ir)d.e(ratudinsd()),&, n tcnictdlhISrnheBmaledomsc.(kwn(ottrhhkrr)ee;aa dd\Isd )x ,.| x ^)t ,i dgIr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hno:Bu562lp:o(15cg:kr (onote: tufield 'nthreads' will be initialized after field 'tidInBlock'hp r)e,a d I562| d | ^~~~~~~~~~~x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~~~~~~~L _PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_60S:I Mnote: Pfield 'group' will be initialized after field 'stepSize'L E]/NC C562L | _ S T E PtSi/ds(itziedo)f,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n t| h group(groupr eads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:n666B:l9o:c knote: (in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread I666d | x . x ) , g r opurpi(mgsr(otuipd),, n T| h ^~~~~~~~~~~r eadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->recvb u562f | f , | t ^i d(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t202h:r53e:a dnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( nthr e202a | d s ) , t i d IRnuBnlWoocrkk(EtlhermeeandtI| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) .run( w563e | ) ; | s ^t epSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppc:c11l:S1h:m enote: min instantiation of member function 'RunWork, 2, 2>::run' requested here. comm. b11u | fIfMSPiLz_eCsO[LNLC_CFLU_NPCR(OATlOl_RSeIdMuPcLeE,] /CNOCLCLLN_ESTT_EDPISR/EsCiTz,e oSfI(MTP)L)E ,{ M a| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, f| l group(groupo at) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::11391:: 95note: :in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here note: expanded from macro 'IMPL_COLL_FUNC' 655 | 391 | R u n Wporrikmp,t rN,C C&Ld_iArLeGcOt_-#>#oaultg,o ,a rNgCsC-L>_sPeRnOdTbOu_f#f#,p raortgos>-(>)r.ercuvnb(u&fnfc,c l S| h ^m em.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 :\53 : | note: ^in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho :r562562k | :E 15l :e m warning: etinitializer order does not match the declaration order [-Wreorder-ctor]ni td<(Ftni,d )T562,, | nR te hd rOtepia,dd (sAt(lingtdoh),r, e Pandrtsoh)tr,oe >ta(id)ds.I(rnnuBtnlh(orwceeka)(d;ts h) r,| e ^at diIddIxn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.B:xl10)o:,c1 k:g( rtnote: ohin instantiation of member function 'RunWork, 2, 2>::run' requested hereur pe(ag dr10Io | duIxpM.)P,xL )_ ,C| O ^~~~~~~~~~~~~~~~~gL rLo_uFp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hU(:Ng562C:r(60oA:ul plnote: )Rfield 'group' will be initialized after field 'stepSize',e d u| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , 562 | | C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) O L L Nt Ei563Td | _( Dt Ii Rd E)sC,tT e,np tSShiIrzMeePa(dLnsEc(,cn ltMShahrxme,ea mdh.sac)lo,fm )mt .i bd| uI^fn fBSl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hio:zc391ek:s(95[t:Nh Crnote: Ceexpanded from macro 'IMPL_COLL_FUNC'La d_IPdRxO .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : warning: sinitializer order does not match the declaration order [-Wreorder-ctor]t epSize(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641 :s11t:e pnote: Sin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ze(nccl S641h | m e m . c o m m . b upfrfiSmisz(etsi[dN-CtCiLd_SPtRaOrTtOR_eSdIuMcPeL,E ]n/TNhCrCeLa_dSsTREePdSu/csei,z edoifr(eTc)t)- >{d o w| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, &| d group(groupi rect->out, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h-:>641s:e11n:d bnote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref f, a r641g | s - > r e c v b u f fp,r i m| s ^( tid-tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:R202e:d53u:c enote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here nThrea d202s | R e d u c e , dRiurneWcotr-k>Edloewmne,n t&RoeudtO,p ,a rAglsg-o>,s ePnrdobtuof>f(,) .arrugns(-w>er)e;c v b| u ^f f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::1202:: 53note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 10 | IMP202L | _ C O L L _ F U NRCu(nAWlolrRkeEdluecmee,n tC,( )h.arlufn)( w e| )^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp95::11 :note: 1expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 39111 | | I MRPuLn_WCoOrLkL<_nFcUcNlCF(uAnlcl#R#efduuncce,, tCyOpLeL,N EFTu_nDcI#R#EdCeTv,r eSdIoMpPa,x ,N CfClLo_aAtL)G O _| #^# alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:,391 :N95C:C Lnote: _expanded from macro 'IMPL_COLL_FUNC'P ROTO_ #391# | p r oRtuon>W(o)r.krfield 'nthreads' will be initialized after field 'tidInBlock', NCCL_ A562L | G O _ # #taildg(ot,i dN)C,C Ln_tPhRrOeTaOd_s#(#nptrhorteoa>d(s)).,r utni(d&InncBclloSchkm(etmh.rweoardkI)d;x .\x ) ,| ^g roup(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,note: field 'nthreads' will be initialized after field 'tidInBlock' | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(tehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),202 gro:u53p:( gnote: roin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu p), | ^~~~~~~~~~~~~~~~~ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k Element <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on>t(h)r.eraudns()w,e )t;i d I| n ^B lock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :g11r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup) ,11 | I| M ^~~~~~~~~~~P L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group( g563r | o u p ) ,s t e| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nccl S563h | m e m . csotmemp.Sbiuzfef(SniczcelsS[hNmCeCmL._cPoRmOmT.Ob_uSfIfMSPiLzEe]s/[NNCCCCLL__SPTREOPTSO/_sSiIzMePoLfE(]T/)N)C C{L _ S| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E P S| / group(groups izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655| : group(group11 : note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 655655: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655p | r i m s ( t i d - t ipdrSitmasr(ttRiedd-utcied,S tnaTrhtrReeadduscRee,d uncTeh,r enaudlslRpetdru,c e&,d inruelcltp-t>ro,u t&,d iarregcst-->>soeuntd,b uafrfg,s -a>rsgesn-d>bruefcfv,b uafrfg,s - >| r ^e cvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^202 :53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: note: 202in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | 202 | R u n W o rRkuEnlWeomreknEtlP(r)o.trou>n(()w.er)u;n ( w| e ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp10::111::1 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 1011 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, MMaaxx,, hfallofa)t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<, NCCL_A#,a lNgCoC,L _NACLCGLO__P#R#OaTlOg_o#,# pNrCoCtLo_>P(R)O.TrOu_n#(#&pnrcoctloS>h(m)e.mr.uwno(r&kn)c;c l\S h m| e ^m .wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:)562;: 15\: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), 562| | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n thread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~d x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:LE562,: 15M:a xwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] float) | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : tid(t391i:d95):, note: nexpanded from macro 'IMPL_COLL_FUNC't hreads (391n | t h rReuandWso)r,k , | N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_AL G563O | _ # # a lsgtoe,p SNiCzCeL(_nPcRcOlTSOh_m#e#mp.rcootmom>.(b)u.frfuSni(z&ensc[cNlCSChLm_ePmR.OwToOr_kS)I;M P\L E ]| / ^N CCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:E562P:S15/:s inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e of(T) )562 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ( group(groupt id), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n655t:h11r:e anote: din instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ), tid I655n | B l o c k ( t h r e apdrIidmxs.(xt)i,d -gtrioduSpt(agrrtoRuepd)u,c e ,| ^~~~~~~~~~~~~~~~~n Thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60R:e dnote: ufield 'group' will be initialized after field 'stepSize'c e, nul l562p | t r , &tdiidr(etcitd-)>,o untt,h raeragdss-(>nstehnrdebaudfsf),, atrigdsI-n>Brleoccvkb(utfhfr,e a d| I ^d x.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202u:p53(:g rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu p), 202| | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ecvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15&:n cwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.wo r562k | ) ; \ t i| d ^( tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s (note: nfield 'nthreads' will be initialized after field 'tidInBlock't hreads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h readI d563x | . x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~~~~~~~m .co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:m562.:b60u:f fnote: Sfield 'group' will be initialized after field 'stepSize'i zes[N C562C | L _ P R OtTiOd_(StIiMdP)L,E ]n/tNhCrCeLa_dSsT(EnPtSh/rseiazdeso)f,( Tt)i)d I{n B l| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c k (| t group(grouph readIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)641,: 11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (group )641, | | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562202: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWor k562E | l e m e ntti,( )t.irduInn(Bwleo)c;k ( t| h ^r eadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.:x11):,1 :g rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested hereu p(gr o11u | pI)M,P L _| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O L L| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)F UNC(A l563l | R e d u cset,e pCSOiLzLeN(EnTc_cDlISRhEmCeTm,. cSoImMmP.LbEu,f fMSaixz,e sf[lNoCaCtL)_ P R| O^T O_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P391L:E95]:/ Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'C L_STEP S391/ | s i zReuonfW(oTr)k)< n{c c l| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n c| # group(group# func, ty/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:e655,: 11F:u nnote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #devre d655o | p < t y p e > , N CpCrLi_mAsL(GtOi_d#-#taildgSot,a rNtCRCeLd_uPcReO,T On_T#h#rperaodtsoR>e(d)u.creu,n (n&unlclcpltSrh,m e&md.iwroerckt)-;> o\u t ,| ^a rgs-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:s562e:n15d:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f , arg s562- | > r e c vtbiudf(ft,i d )| , ^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202(:n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds), 202t | i d I n B l o c kR(utnhWroerakdEIldexm.exn)t,< Fgnr,o uTp,( gRreoduOpp),, A l| g ^~~~~~~~~~~~~~~~~o , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:r562o:t60o:> (note: )field 'group' will be initialized after field 'stepSize'. run(w e562) | ; | ^t id(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:)12,: 1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads( n12t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~M ax, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup), | ^~~~~~~~~~~~~~~~~562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~p Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o60u:p (note: gfield 'group' will be initialized after field 'stepSize'r oup), 562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(StihzreesadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e (nccl S563h | m e m . csotmemp.Sbiuzfef(SniczcelsS[hNmCeCmL._cPoRmOmT.Ob_uSfIfMSPiLzEe]s/[NNCCCCLL__SPTREOPTSO/_sSiIzMePoLfE(]T/)N)C C{L _ S| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E P S| / group(groups izeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~626 : 9| : group(group note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 687 : 11 : pnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ms(tid -687t | i d S t a r t S c a tptreirm,s (ntTihdr-etaiddsSStcaartttBecra,s tN,U LnLT,h rdeiardescBtc-a>sutp,, &adrigrse-c>ts-e>nodubtu,f fn,u lalrpgtsr-,> raercgvsb-u>fsfe,n d b| u ^f f, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:>202r:e53c:v bnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref f, | 202 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202R:u53n:W onote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek Elem e202n | t < F n , T , RRuendWOopr,k EAllegmoe,n tPT(,) .RreudnO(pw,e )A;l g o| , ^ Proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp(:)11.:r1u:n (note: win instantiation of member function 'RunWork, 2, 2>::run' requested heree ); | 11 ^ | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppC:O11L:L1_:F Unote: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereC (All R11e | dIuMcPeL,_ CCOOLLLL_NFEUTN_CD(IARlElCRTe,d uScIeM,P LCEO,L LMNaExT,_ DfIlRoEaCtT), S| I^M PLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:a391x:,95 :f lnote: oexpanded from macro 'IMPL_COLL_FUNC'a t) | ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r k#,# dNeCvCrLe_dAoLpGl,g oN,C CNLC_CALL_GPOR_O#T#Oa_l#g#op,r oNtCoC>L(_)P.RrOuTnO(_&#n#cpcrloSthom>e(m)..wrournk()&;n c\c l S| h ^m em.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:)562;: 15\: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~p (gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,60 : | note: ^~~~~~~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :562 | note: field 'group' will be initialized after field 'stepSize' t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::641666::119:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666641 | | p r ipmrsi(mtsi(dt,i dn-TthirdeSatdasrGtaRtehdeurc,e ,d inrTehcrte-a>duspR,e dNuUcLeL,, dairrgesc-t>-s>ednodwbnu,f f&,d iarregcst-->>roeuctv,b uafrfg,s - >| s ^e ndbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s202-:>53r:e cnote: vin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereb uff, 202 | | ^ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202W:o53r:k Enote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ment <202F | n , T , R e dROupn,W oArlkgEol,e mPernottn(,) .Tr,u nR(ewdeO)p;, A| l ^g o, Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp>:(12):.1r:u nnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested herew e); 12| | ^I MPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppL:_12F:U1N:C (note: Ain instantiation of member function 'RunWork, 2, 2>::run' requested herel lRed u12c | eI,M PCLO_LCLONLELT__FDUINRCE(CATl,l RSeIdMuPcLeE,, CMOaLxL,N EdTo_uDbIlReE)C T ,| ^S IMPLE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :M391a:x95,: dnote: oexpanded from macro 'IMPL_COLL_FUNC'u ble) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u391n:W95o:r knote: u,n cN#C#CdLe_vArLeGdOo_p#<#taylpgeo>,, NNCCCCLL__PARLOGTOO__####aplrgoot,o >N(C)C.Lr_uPnR(O&TnOc_c#l#Sphrmoetmo.>w(o)r.kr)u;n (\& n c| c ^l Shmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k15):; note: \field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~r oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 563 | : 562 :s15t:e pwarning: Sinitializer order does not match the declaration order [-Wreorder-ctor]i ze(ncclShme m562. | c o m m .tbiudf(ftSiidz)e,s [nNtChCrLe_aPdRsO(TnOt_hSrIeMaPdLsE)],/ NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 677 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 11 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 677 | s t e p S i z ep(rnicmcsl(Sthimde-mt.icdoSmtma.rbtuBfcfaSsitz,e sn[TNhCrCeLa_dPsRBOcTaOs_tS,I M&PdLiEr]e/cNtC-C>Lo_uStT,E PdSi/rseiczte-o>fd(oTw)n), {a r g| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- > s| e group(groupn dbuff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:r641e:c11v:b unote: fin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref , | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ims( t202i | d - t i d S t a rRtuRneWdourckeE,l enmTehnrte,d oPwrno,t o&>d(i)r.ercutn-(>woeu)t;, a| r ^g s->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:b12u:f1f:, note: ain instantiation of member function 'RunWork, 2, 2>::run' requested herer gs-> r12e | cIvMbPuLf_fC,O L L| _ ^F UNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d202u:c53e:, note: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO LLNE T202_ | D I R E C T , SRIuMnPWLoEr,k EMlaexm,e ndto( )391. | r u nR(uwneW)o;r k <| n ^c clFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp#:f12u:n1c:, note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herey pe, F12u | nIcM#P#Ld_eCvOrLeLd_oFpUl,R eNdCuCcLe_,A LCGOOL_L#N#EaTl_gDoI,R ENCCTC,L _SPIRMOPTLOE_,# #Mparxo,t od>o(u)b.lreu)n ( &| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/15s:i zwarning: einitializer order does not match the declaration order [-Wreorder-ctor]o f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)687,: 11n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads(nt h687r | e a d s ) , t i d IpnrBilmosc(kt(itdh-rteiaddSItdaxr.txB)c,a sgtr,o unpT(hgrreoaudps)B,c a s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, &| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i rect- >563o | u t , nsutlelppStirz,e (anrcgcsl-S>hsmeenmd.bcuofmfm,. baurfgfsS-i>zreesc[vNbCuCfLf_,P R O| T ^O _SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:]202/:N53:C Cnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ STEP S202/ | s i z e o f ( T )R)u n{W o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E l e| m group(groupe nt, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel go, Pr o677t | o > ( ) . r u n ( w ep)r;i m s| ( ^t id-tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppS:t12a:r1t:B cnote: ain instantiation of member function 'RunWork, 2, 2>::run' requested heres t, n12T | hIrMePaLd_sCBOcLaLs_tF,U N&Cd(iArlelcRte-d>uocuet,, CdOiLrLeNcEtT-_>DdIoRwEnC,T ,a rSgIsM-P>LsEe,n dMbauxf,f ,d oaurbglse-)> r e| c^v buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202391: | 53 : Rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Work <202n | c c l F u n c # #RfuunnWco,r ktEylpeem,e nFtuo,, NPCrCoLt_oA>L(G)O._r#u#na(lwgeo),; N C| C ^L _PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp#:#10p:r1o:t onote: >in instantiation of member function 'RunWork, 2, 2>::run' requested here( ).ru n10( | &InMcPcLl_SChOmLeLm_.FwUoNrCk()A;l l\R e d| u ^c e, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:N15E:T _note: Dfield 'nthreads' will be initialized after field 'tidInBlock'I RECT ,562 | S I M P LtEi,d (Mtaixd,) ,h anltfh)r e a| d^s (nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBloc k391( | t h rReuandWIodrxk. , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ evredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _warning: Pinitializer order does not match the declaration order [-Wreorder-ctor]R OTO_##p r562o | t o > ( )t.irdu(nt(i&dn)c,c lnSthhmreeads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~a dIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): warning: initializer order does not match the declaration order [-Wreorder-ctor] 563 | st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | group(group tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666 :s9t:e pnote: Sin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ze(n c666c | l S h m e m . c opmrmi.mbsu(ftfiSdi,z ensT[hNrCeCaLd_sPGRaOtThOe_rS,I MdPiLrEe]c/tN-C>CuLp_,S TNEUPLSL/,s iazregosf-(>Ts)e)n d{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->recvbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:f687,: 11 :| ^note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202687: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t i dR-utniWdoSrtkaErlteBmceanstt<,F nn,T hTr,e aRdesdBOcpa,s tA,l g&od,i rPercott-o>>o(u)t.,r unnu(lwlep)t;r , | a ^r gs->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:n13d:b1u:f fnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here args -13> | rIeMcPvLb_uCfOfL,L _ F| U ^N C(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:c202e:,53 :C Onote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL NET_D I202R | E C T , S I M PRLuEn,W oMrakxE,l erccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hwarning: :unused variable 'flag2' [-Wunused-variable] 386:9: 153warning: | variable 'wireOffset' set but not used [-Wunused-but-set-variable] u i386n | t 3 2 _ ti ndta twai1r,e Offlfasge1t, =d aWtiar2e,W ofrldaPge2r;S l i| c ^~~~~e *warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uintIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>(ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o)u.pr(ugnr(o&unpc)c,l S h| m ^~~~~~~~~~~~~~~~~e m.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, fIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ lag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->reIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: dOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 7 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork10,: In file included from N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hC:C167L: _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCCL_PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(n t563h | r e a d ss)t,e ptSiidzIen(BnlcocclkS(htmherme.acdoImdmx..bxu)f,f Sgirzoeusp[(NgCrCoLu_pP)R,O T O| _ ^~~~~~~~~~~~~~~~~S IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'S TEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B626l:o9c:k (note: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh readI d626x | . x ) , g r o uppr(igmrso(utpi)d,- t i| d ^~~~~~~~~~~S tartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(&:n562c:c15l:S hwarning: minitializer order does not match the declaration order [-Wreorder-ctor]e m.work); 562\ | | ^ tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~~~~~~~h mem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:m562m:.60b:u fnote: ffield 'group' will be initialized after field 'stepSize'S izes[N C562C | L _ P R OtTiOd_(StIiMdPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] NCCL_ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~~~~~~~T EP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i60z:e onote: ffield 'group' will be initialized after field 'stepSize'( T)) { 562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | t group(groupi d(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e677a:d11s:( nnote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh r 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)In file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :t1i: dIn file included from I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:B10l: oIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hk:(167t: h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xwarning: .initializer order does not match the declaration order [-Wreorder-ctor]x ), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthrea d563s | ( n t h rsetaedpsS)i,z et(indcIcnlBSlhomcekm(.tchormema.dbIudfxf.Sxi)z,e sg[rNoCuCpL(_gPrRoOuTpO)_,S I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L E ]| / tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)N CCL_ S563T | E P S / ssitzeepoSfi(zTe)()n c{c l S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| . group(groupc omm.buffSizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:T655O:_11S:I Mnote: Pin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL E]/NCCL_ S655T | E P S / s i z e o f (pTr)i)m s{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- t i| d group(groupS tartReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:T626h:r9e:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereR educ e626, | n u l l p t r ,p r&idmisr(etcitd-->toiudtS,t aarrtgSsc-a>tsteenrd,b unfTfh,r eaardgssS-c>artetcevrb,u fNfU,L L ,| ^d irect->up,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a202r:g53s:- >note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ndbuf f202, | a r g s - > r eRcuvnbWuofrfk,E l e| m ^e nt, 2, 2>::run' requested hered Op, 202A | l g o , P r o tRou>n(W)o.rrkuEnl(ewmee)n;t < F| n ^, T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppO:p5,: 1A:l gnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here, Prot o5> | (I)M.PrLu_nC(OwLeL)_;F U N| C ^( AllRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:u4c:e1,: Cnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested hereL LNE T4_ | DIIMRPELC_TC,O LSLI_MFPULNEC,( APlrlReduceeM,u lCSOuLmL,N EuTi_nDtI8R_EtC)T , | S^I MPLE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :P391r:e95M:u lnote: Sexpanded from macro 'IMPL_COLL_FUNC'u m, int8 _391t | ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391u:n95W:o rnote: kexpanded from macro 'IMPL_COLL_FUNC'< ncclF u391n | c # #RfuunnWco,r ktn,c #N#CdCeLv_rAeLdGoOp_<#t#yapleg>o,, NNCCCCLL__APLRGOOT_O#_##a#lpgroo,t oN>C(C)L._rPuRnO(T&On_c#c#lpSrhomteom>.(w)o.rrku)n;( &\n c c| l ^S hmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:)562;: 15\: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~p (gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,60 : | note: ^~~~~~~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60562: | note: field 'group' will be initialized after field 'stepSize' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NcCCL_PRcOlTSOh_m#e#mp.rwork); o\t o >| ( ^) .run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:w15o:r knote: )field 'nthreads' will be initialized after field 'tidInBlock'; \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement:( )warning: .initializer order does not match the declaration order [-Wreorder-ctor]r un(we); | 562 ^ | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:(4t:i1d:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested heren threa d4s | (InMtPhLr_eCaOdLsL)_,F UtNiCd(IAnlBllRoecdku(cteh,r eCaOdLILdNxE.Tx_)D,I RgErCoTu,p (SgIrMoPuLpE),, P r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~M u l| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u m, i n563t | 8 _ t ) s t| e^p Size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.comm .391b | u f fRSuinzWeosr[kN, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk); \ : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s, )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60563: | note: field 'group' will be initialized after field 'stepSize' step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 5624: | 15I:M Pwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ COLL_FU N562C | ( A l l Rteiddu(ctei,d )C,O LnLtNhErTe_aDdIsR(EnCtTh,r eSaIdMsP)L,E ,t iPdrIenMBulloScukm(,t hirneta8d_Itd)x . x| )^, group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g391r:o95u:p )note: ,expanded from macro 'IMPL_COLL_FUNC' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | R u563n | W o r k T,O _NSCICMLP_LAEL]G/ON_C#C#La_lSgToE,P SN/CsCiLz_ePoRfO(TTO)_)# #{p r o| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o > (| ) group(group. run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l626S:h9m:e mnote: .in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herew ork); 626\ | | ^ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'- tidSt a562r | t S c a tttiedr(,t indT)h,r enatdhsrSecaadtst(enrt,h rNeUaLdLs,) ,d itriedcItn-B>luopc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202field 'group' will be initialized after field 'stepSize': 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562::15562:: 15note: :field 'nthreads' will be initialized after field 'tidInBlock' warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , tid (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 60 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: field 'group' will be initialized after field 'stepSize' 563562 | | sttiedp(Stiizde)(,n cnctlhSrhemaedms.(cnotmhmr.ebaudfsf)S,i zteisd[INnCBClLo_cPkR(OtThOr_eSaIdMIPdLxE.]x/)N,C CgLr_oSuTpE(PgSr/osuipz)e,o f (| T ^~~~~~~~~~~) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15s:e nwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]b uff, arg s562- | > r e c vtbiudf(ft,i d )| , ^ nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ), t i202d | I n B l o c k ( tRhurneWaodrIkdExl.exm)e,n tg563( | ) . r u ns(tweep)S;i z e| ( ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppm:.4c:o1m:m .note: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ffSi z4e | sI[MNPCLC_LC_OPLRLO_TFOU_NSCI(MAPlLlER]e/dNuCcCeL,_ SCTOELPLSN/EsTi_zDeIoRfE(CTT),) S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , | P group(groupr eMulSum,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :i687n:t118:_ tnote: )in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO _##pr o t o > ( ) .prruinm(s&(ntcicdl-SthimdeSmt.awrotrBkc)a;s t\, n| T ^h readsBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562,: 15&:d inote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ct->ou t562, | n u l ltpitdr(,t iadr)g,s -n>tshernedabdusf(fn,t harregasd-s>)r,e ctvibduIfnfB,l o c| k ^( threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o202u:p53(:g rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu p), | 202 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60R:u nnote: Wfield 'group' will be initialized after field 'stepSize'o rkEle m562e | n t < F nt,i dT(,t iRde)d,O pn,t hArlegaod,s (Pnrtohtroe>a(d)s.)r,u nt(iwdeI)n;B l o| c ^k (thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppI:d4x:.1x:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereg roup (4g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~_ FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s (nthread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~k (th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60I:d xnote: .field 'group' will be initialized after field 'stepSize'x ), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads (563n | t h r e asdtse)p,S itzied(InncBclloSchkm(etmh.rceoamdmI.dbxu.fxf)S,i zgerso[uNpC(CgLr_oPuRpO)T,O _ S| I ^~~~~~~~~~~M PLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^~~~~~~~~~~~~~~~~562 :15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht->o:u562t:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->sendbuff, a r562g | s - > r etcivdb(utfifd,) , | n ^t hreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tidI n202B | l o c k ( t h r eRaudnIWdoxr.kxE)l,e mgernotu ( )s.treupnS(iwzee)(;n c c| l ^S hmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppm:m6.:b1u:f fnote: Sin instantiation of member function 'RunWork, 2, 2>::run' requested herei zes[N C6C | LI_MPPRLO_TCOO_LSLI_MFPULNEC](/ANlClCRLe_dSuTcEeP,S /CsOiLzLeNoEfT(_TD)I)R E{C T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S I M| P group(groupL E, PreMulSum, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:n641t:3112:_ tnote: )in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 641/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prim s391( | t i dR-utniWdoSrtkaddoopwi,r eNcCtC-L>_oAuLtG,O _a#r#gasl-g>os,e nNdCbCuLf_fP,R OaTrOg_s#-#>prreoctvob>u(f)f.,r u n| ( ^& ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202w:o53r:k )note: ;in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here \ | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15R:u nnote: Wfield 'nthreads' will be initialized after field 'tidInBlock'o rkEle m562e | n t < F nt,i dT(,t iRde)d,O pn,t hArlegaod,s (Pnrtohtroe>a(d)s.)r,u nt(iwdeI)n;B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:I7d:x1.:x )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here grou p7( | gIrMoPuLp_)C,O L L| _ ^~~~~~~~~~~~~~~~~F UNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:l562l:R60e:d unote: cfield 'group' will be initialized after field 'stepSize'e , COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r ePardesM(unltShurme,a dusi)n,t 3t2i_dtI)n B l| o^c k(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95I:d xnote: .expanded from macro 'IMPL_COLL_FUNC'x ), gro u391p | ( g rRouunpW)o,r k <| n ^~~~~~~~~~~c clFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>(). r562u | n ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~~~~~~~[ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: Sfield 'group' will be initialized after field 'stepSize'I MPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h666r:e9a:d Inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex .x), g666r | o u p ( g r o u pp)r,i m s| ( ^~~~~~~~~~~t id, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]563 | step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_ISdTxE.PxS)/,s igzreooufp((Tg)r)o u{p ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| group(group | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666: 9563: | note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here step S666i | z e ( n c c l S hpmreimm.sc(otmimd.,b unfTfhSriezaedss[GNaCtChLe_rP,R OdTiOr_eScItM-P>LuEp],/ NNCUCLLL_,S TaErPgSs/-s>iszeenodfb(uTf)f), {a r g| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- > r| e group(groupc vbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 687:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here687 | 202 | p r i mRsu(ntWiodr-ktEildeSmteanrtte(c)t.-r>uonu(tw,e )n;u l l| p ^t r, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpps:-6>:s1e:n dnote: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ff, a6r | gIsM-P>Lr_eCcOvLbLu_fFfU,N C (| A ^l lReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:,202 :C53O:L Lnote: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE T_DI R202E | C T , S I M P LREu,n WPorrekMEulleSmuemn,t ().ru n391( | w e )R;u n W| o ^r k, 2, 2>::run' requested heren c, t y5p | eI,M PFLu_nCcO#L#Ld_eFvUrNeCd(oAplc,e ,N CCCOLL_LANLEGTO__D#I#RaElCgTo,, SNICMCPLL_EP,R OPTrOe_M#u#lpSruomt,o >u(i)n.tr8u_nt()& n c| c^l Shm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:m391.:w95o:r knote: )expanded from macro 'IMPL_COLL_FUNC'; \ | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r knote: ,t iNdCICnLB_lAoLcGkO(_t#h#raelagdoI,d xN.CxC)L,_ PgRrOoTuOp_(#g#rporuopt)o,> ( )| . ^~~~~~~~~~~~~~~~~r un/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:&562n:c60c:l Snote: hfield 'group' will be initialized after field 'stepSize'm em.wo r562k | ) ; \ t i| d ^( tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]E lementh(r)e.ardusn((nwteh)r;e a d| s ^) , tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppn:B7l:o1c:k (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read I7d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C OLLNE T563_ | D I R E CsTt,e pSSIiMzPeL(En,c cPlrSehMmuelmS.ucmo,m mu.ibnutf3f2S_itz)e s [| N^C CL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:O391T:O95_:S Inote: Mexpanded from macro 'IMPL_COLL_FUNC'P LE]/NC C391L | _ S TREuPnSW/osrikz, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> , NCCL _687A | L G O _ # # a l g o ,p rNiCmCsL(_tPiRdO-TtOi_d#S#tparrottBoc>a(s)t.,r unnT(h&rnecacdlsSBhcmaesmt.,w o&rdki)r;e c\t - >| o ^u t, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:l562l:p15t:r ,note: field 'nthreads' will be initialized after field 'tidInBlock'a rgs-> s562e | n d b u ftfi,d (atrigds)-,> rnetchvrbeuafdfs,( n t| h ^r eads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:I53n:B lnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec k(th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~~~~~~~ T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562e:d60O:p ,note: field 'group' will be initialized after field 'stepSize'A lgo, P562r | o t o > (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:h5r:e1a:d snote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, tid I5n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Axl)l,R egdruocuep,( gCrOoLuLpN)E,T _ D| I ^~~~~~~~~~~R ECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Reduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:i562n:t158:_ twarning: )initializer order does not match the declaration order [-Wreorder-ctor] | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid), n391t | h r eRaudnsW(onrtkhu,p )N,C C L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A L G| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##alg o563, | N C C Ls_tPeRpOSTizeO(_n#c#cplrSohtmoe>m(.)c.ormumn.(b&unfcfcSliSzhemse[mN.CwCoLr_kP)R;O T\O _ S| I ^M PLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15S:T Enote: Pfield 'nthreads' will be initialized after field 'tidInBlock'S /size o562f | ( T ) ) t{i d (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d )| , group(group nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n677t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ), ti d677I | n B l o c k ( t h r epardiImdsx(.txi)d,- tgirdoSutpa(rgtrBocuaps)t,, n| T ^~~~~~~~~~~~~~~~~h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:B562c:a60s:t ,note: field 'group' will be initialized after field 'stepSize'& direc t562- | > o u t ,t iddi(rteicdt)-,> dnotwhnr,e aadrsg(sn-t>hsreenaddbsu)f,f ,t iadrIgnsB-l>orcekc(vtbhurfefa,d I d| x ^. x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202(:g53r:o unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , | ^~~~~~~~~~~202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::6562::115:: note: warning: in instantiation of member function 'RunWork, 2, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 6 | IMPL _562C | O L L _ FtUiNdC((tAildl)R,e dnutcher,e aCdOsL(LnNtEhTr_eDaIdRsE)C,T ,t iSdIIMnPBLlEo,c kP(rtehMruelaSduImd,x .ixn)t,3 2g_rto)u p (| g^r oup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | expanded from macro 'IMPL_COLL_FUNC' tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391563 | | R u nsWtoerpkSM,P LNEC]C/LN_CACLLG_OS_T#E#PaSl/gsoi,z eNoCfC(LT_)P)R O{T O _| # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~# p r| o group(groupt o>().run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:c666c:l9S:h mnote: emin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. work); 666\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :p562r:i15m:s (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d, n T562h | r e a d stGiadt(hteird,) ,d inrtehcrte-a>dusp(,n tNhUrLeLa,d sa)r,g st-i>dsIennBdlboucfkf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p| ( ^g roup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~: 202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53::562 :note: 60in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: field 'group' will be initialized after field 'stepSize' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernetah(r)e.arduInd(xw.ex));, g| r ^o up(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppu:p7):,1 : | note: ^~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::687655::1111:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687655 | | pprriimmss((ttiidd--ttiiddSSttaarrttBRceadsutc,e ,n TnhTrheraedasdBscRaesdtu,c e&,d inruelcltp-t>ro,u t&,d inruelcltp-t>ro,u ta,r gasr-g>ss-e>nsdebnudfbfu,f fa,r gasr-g>sr-e>crvebcuvfbfu,f f ,| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetno(>)(.)r.urnu(nw(ew)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::57::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 57 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, PPrreeMMuullSSuumm,, uuiinntt83_2t_)t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<>,, NNCCCCLL__AALLGGOO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :warning: initializer order does not match the declaration order [-Wreorder-ctor]warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t e psStiezpeS(inzcec(lnSchcmleSmh.mceomm.mc.obmumf.fbSuifzfeSsi[zNeCsC[LN_CPCRLO_TPOR_OSTIOM_PSLIEM]P/LNEC]C/LN_CSCTLE_PSST/EsPiSz/esoifz(eTo)f)( T{) ) | { ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 677note: :in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | 677 | p r i m s ( t i d ,p rniTmhsr(etaidds-GtaitdhSetra,r tdBicraesctt,- >nuTph,r eNaUdLsLB,c aasrtg,s -&>dsiernedcbtu-f>fo,u ta,r gdsi-r>ercetc-v>bduofwfn,, a| r ^g s->sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:b202u:f53f:, note: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer gs- >202r | e c v b u f f , R u| n ^W orkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:n202t:<53F:n ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT , Red O202p | , A l g o , PRruontWoo>r(k)E.lreumne(nwte<)F;n , | T ^, RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppp:,6 :A1l:g onote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Prot o6> | (I)M.PrLu_nC(OwLeL)_;F U N| C ^( AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppu:c7e:,1 :C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL NET_D I7R | EICMTP,L _SCIOMLPLL_EF,U NPCr(eAMlullRSeudmu,c ei,n tC3O2L_LtN)E T _| D^I REC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:,391 :S95I:M Pnote: Lexpanded from macro 'IMPL_COLL_FUNC'E , Pre M391u | l S uRmu,n Wuoirnkt<3n2c_ctl)F u n| c^# #fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:,391 :t95y:p enote: ,expanded from macro 'IMPL_COLL_FUNC' Func##d e391v | r e dRoupnn,c cNlCFCuLn_cA#L#GfOu_n#c#,a ltgyop,e ,N CFCuLn_cP#R#OdTeOv_r#e#dporpop(e)>.,r uNnC(C&Ln_cAcLlGSOh_m#e#ma.lwgoor,k )N;C C\L _ P| R ^O TO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:p562r:o15t:o >note: (field 'nthreads' will be initialized after field 'tidInBlock') .run (562& | n c c l Sthimde(mt.iwdo)r,k )n;t h\r e a| d ^s (nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, note: tfield 'nthreads' will be initialized after field 'tidInBlock'i dInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~T EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | 563t | i d ( t isdt)e,p Snitzher(enacdcsl(Snhtmherme.acdosm)m,. btuifdfISniBzleosc[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~z eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, &dire:c562t:-15>:o uwarning: tinitializer order does not match the declaration order [-Wreorder-ctor], args->send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Bloc k202( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrtoe(p)S.irzuen((nwcec)l;S h m| e ^m .comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppb:u6f:f1S:i znote: ein instantiation of member function 'RunWork, 2, 2>::run' requested heres [NCC L6_ | PIRMOPTLO__CSOILMLP_LFEU]N/CN(CAClLl_RSeTdEuPcSe/,s iCzOeLoLfN(ETT)_)D I{R E C| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, S| I group(groupM PLE, PreMulS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:m641,: 11i:n tnote: 3in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here2 _t) | 641^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : pnote: rexpanded from macro 'IMPL_COLL_FUNC'i ms(ti d391- | t i dRSutnaWrotrRkeedvorwend,o p&t,- >NoCuCtL,_ AaLrGgOs_-#>#saelngdob,u fNfC,C La_rPgRsO-T>Or_e#c#vpbruoftfo,> ( )| . ^r un(&ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: win instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo rk); 202\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r knote: Efield 'nthreads' will be initialized after field 'tidInBlock'l ement <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on>t(h)r.eraudns()w,e )t;i d I| n ^B lock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:e7a:d1I:d xnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herex ), g r7o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~~~~~~~C (Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:R562e:d60u:c enote: ,field 'group' will be initialized after field 'stepSize' COLL N562E | T _ D I RtEiCdT(,t iSdI)M,P LnEt,h rPeraedsM(unltShurme,a dusi)n,t 3t2i_dtI)n B l| o^c k(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:I95d:x .note: xexpanded from macro 'IMPL_COLL_FUNC') , grou p391( | g r oRuupn)W,o r k| < ^~~~~~~~~~~n cclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho>().run:(562&:n15c:c lwarning: Shinitializer order does not match the declaration order [-Wreorder-ctor]m em.work); \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock') , nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup(g r563o | u p ) , s t| e ^~~~~~~~~~~~~~~~~p Siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:(562n:c60c:l Snote: hfield 'group' will be initialized after field 'stepSize'm em.c o562m | m . b u ftfiSdi(zteisd[)N,C CnLt_hPrReOaTdOs_(SnItMhPrLeEa]d/sN)C,C Lt_iSdTIEnPBSl/oscikz(etohfr(eTa)d)I d{x . x| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, g| r group(groupo up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562b:u15f:f ,warning: initializer order does not match the declaration order [-Wreorder-ctor]a rgs->re c562v | b u f f ,t i d| ( ^t id), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:( nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh read s202) | , t i d I n B lRoucnkW(otrhkrEelaedmIednxt. ( ) 563.t | ri udn (( wt eis)dt;)e ,p S| n ^it zher(enacdcs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppl:(S7n:htm1he:r menote: .ain instantiation of member function 'RunWork, 2, 2>::run' requested herecd osm)m, .7 b | tIuiMfdPfILSn_iBCzlOeoLscLk[_(NFtCUhCNrCL(e_AaPldRlIORdTexOd._uxcS)eI,,M PCgLOrELo]uL/NpNE(CTgC_rLDo_IuSRpTE)EC,PT S, / | sS ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i Iz Me| Po tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)Lf E(,T ) P)563r | {e M u | l S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~su t me| ,p group(groupSu iiznet(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h3n:2c666_ct:l)9S: h m note: e| in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem^ . co m666m | . /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb :u 391f :f 95S :i znote: eexpanded from macro 'IMPL_COLL_FUNC'ps r[iNmCsC(L t_391iP | dR ,O T ROnu_TnShWIorMrePkaLetuoypfp,(e T,N) )UF LuL{,n c #a| #r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d g e| sv group(group-r >esdeonpd,:, 11 :aN rCnote: Cgin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereLs _-A>L rG655eO | c_ v# b# ua fl fg ,o , | N ^Cp CrLi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_m:Ps202R(:Ot53TiO:d -_note: t#in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei# dpSr ott202ao | r>t (R) e. dr uu cn e (, & nRncuTcnhlWrSoehrakmdeEsmlR.eewmdoeurnctke<),; F nn\u, l l| Tp ^,t rR,e d&Odp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi,:r 562e:Ac15ltg:-o >note: ,ofield 'nthreads' will be initialized after field 'tidInBlock' uP tr,o tao r>562g( | s )- .>r su entn(iddwb(eut)fi;fd ,) ,| a ^rn gtsh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp-r:>e8r:ae1cd:sv (bnote: nuin instantiation of member function 'RunWork, 2, 2>::run' requested heretf hfr, e a8d | s| I) ^,M PtLi_dCIOn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hLB:lL202o_:cF53k:U( Nnote: tCin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh( rAel al202dRI | ed dx u. cx e ), , C gORrLuLonNuWEpoTr_(kDgEIrlReoEmuCepT)n,,t < S F| In ^~~~~~~~~~~~~~~~~M, P LTE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, : R562P:er60deO:Mp unote: ,field 'group' will be initialized after field 'stepSize'l ASlugmo, , 562 i | Pn rt o6 t4 o_tt>i()d)( .tr iud| n)(^,w en)t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;h :r 391| e: ^a95d s:( nnote: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppexpanded from macro 'IMPL_COLL_FUNC'h: r7e:a1d :s391 | )note: , in instantiation of member function 'RunWork, 2, 2>::run' requested here tRiud nI7nW | BoIlrMokPcC,C LN_CPCRLO_TAOL_GSOI_M#P#LaEl]g/oN,C CNLC_CSLT_EPPRSO/TsOi_z#e#opfr(oTt)o)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( &| n group(groupc clShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:o687r:k11):; note: \in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock'p rims( t562i | d - t i dtSitda(rttiBdc)a,s tn,t hnrTehardesa(dnstBhcraesatd,s )&,d itriedcItn-B>loouctk,( tnhurlelapdtIrd,x .axr)g,s -g>rsoeunpd(bgurfofu,p )a,r g s| - ^~~~~~~~~~~~~~~~~> re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:v562b:u60f:f ,note: field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53(:t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here) , nt h202r | e a d s ( n t h rReuandWso)r,k EtliedmIennBtlo(u)p.)r,u n (| w ^~~~~~~~~~~e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o15u:p )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 60 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: field 'group' will be initialized after field 'stepSize' 563 | 562 | s tteipdS(itzied()n,c cnltShhrmeeamd.sc(onmtmh.rbeuafdfsS)i,z etsi[dNICnCBLl_oPcRkO(TtOh_rSeIaMdPILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n threads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Shmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:o626f:(9T:) )note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 626| | group(group pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:m641s:(11t:i dnote: -in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret idStar t641S | c a t t e r , n T hprreiamdss(Stciadt-tteird,S tNaUrLtLR,e dduicree,c tn-T>hurpe,a dasrRgesd-u>csee,n ddbiurfefc,t -a>rdgosw-n>,r e&cdvibruefcft,- > o| u ^t , arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:-202>:s53e:n dnote: bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu ff, a202r | g s - > r e c v bRuufnfW,o r k| E ^l ement, 2, 2>::run' requested hered Op, A202l | g o , P r o t oR>u(n)W.orruknE(lweem)e;n t <| F ^n , T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppO:p7,: 1A:l gnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here, Prot o7> | ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:(15n:c cwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S hmem.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p655):,11 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 655 | 563 | s t e ppSriizmes((ntcicdl-SthimdeSmt.acrotmRme.dbuucfef,S inzTehsr[eNaCdCsLR_ePdRuOcTeO,_ SnIuMlPlLpEt]r/,N C&CdLi_rSeTcEtP-S>/osuitz,e oafr(gTs)-)> s{e n d| b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u f f| , group(group args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:c641v:b11u:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)MPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562u:m15,: iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t 64_t) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95t:i dnote: (expanded from macro 'IMPL_COLL_FUNC't id), n391t | h r eRaudnsW(onrtkhu,p )N,C C L| _ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: Pnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo to>() .8r | uInM(PwLe_)C;O L L| _ ^F UNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppA:l9l:R1e:d unote: cin instantiation of member function 'RunWork, 2, 2>::run' requested heree , COL L9N | EITM_PDLI_RCEOCLTL,_ FSUINMCP(LAEl,l RPerdeuMcuel,S uCmO,L LiNnEtT6_4D_ItR)E C T| ,^ SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391P:r95e:M unote: lexpanded from macro 'IMPL_COLL_FUNC'S um, uin t3916 | 4 _ tR)u n W| o^r ku,n cN,C CtLy_pAeL,G OF_u#n#ca#l#gdoe,v rNeCdCoLp__,# #NpCrCoLt_oA>L(G)O._r#u#na(l&gnoc,c lNSChCmLe_mP.RwOoTrOk_)#;# p\r o t| o ^> ().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562&:n15c:c lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'h mem.w o562r | k ) ; \t i d| ( ^t id), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:I dnote: xfield 'group' will be initialized after field 'stepSize'. x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,60 :g rnote: ofield 'group' will be initialized after field 'stepSize'u p(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdiIzdexs.[xN)C,C Lg_rPoRuOpT(Og_rSoIuMpP)L,E ] /| N ^~~~~~~~~~~C CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:: 8warning: :initializer order does not match the declaration order [-Wreorder-ctor]1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 8 | I M PtLi_dC(OtLiLd_)F,U NnCt(hArlelaRdesd(uncteh,r eCaOdLsL)N,E Tt_iDdIIRnEBClTo,c kS(ItMhPrLeEa,d IPdrxe.Mxu)l,S ugmr,o uipn(tg6r4o_utp)) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95 :563 | note: expanded from macro 'IMPL_COLL_FUNC' stepS i391z | e ( nRcucnlWSohrmkeT,E PNSC/CsLi_zAeLoGfO(_T#)#)a l{g o ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C C| L group(group_ PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:r641o:t11o:> (note: )in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. run(& n641c | c l S h m e m . w o rpkr)i;m s\( t i| d ^- tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562t:R15e:d unote: cfield 'nthreads' will be initialized after field 'tidInBlock'e , nThr e562a | d s R e dtuicde(,t iddi)r,e cntt-h>rdeoawdns,( n&tdhirreeacdts-)>,o utti,d IanrBglso-c>ks(etnhdrbeuafdfI,d xa.rxg)s,- >grreocuvpb(ugfrfo,u p )| , ^ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202::56253::60 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: field 'group' will be initialized after field 'stepSize' 202 | 562 | tRiudn(Wtoirdk)E,l enmtehnrtet(h)r.eraudnI(dwxe.)x;) , | g ^r oup(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppg:r10o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~ 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53562: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid (202t | i d ) , n t h rReuandWso(rnktEhlreemaednst)<,F nt,i dTI,n BRleodcOkp(,t hArlegaod,I dPxr.oxt)o,> (g)r.oruupn((gwreo)u;p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :5639 | : 1 : note: sin instantiation of member function 'RunWork, 2, 2>::run' requested heret epSi z9e | (InMcPcLl_SChOmLeLm_.FcUoNmCm(.AblulfRfeSdiuzcees,[ NCCOCLLL_NPERTO_TDOI_RSEICMTP,L ES]I/MNPCLCEL,_ SPTrEePMSu/lsSiuzme,o fu(iTn)t)6 4{_ t )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 677expanded from macro 'IMPL_COLL_FUNC': 11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | R677u | n W o r k < n c c l Fpurnicm#s#(ftuindc-,t itdySptea,r tFBucnacs#t#,d envTrherdeoapdt,, N&CdCiLr_eAcLtG-O>_o#u#ta,l gdoi,r eNcCtC-L>_dPoRwOnT,O _a#r#gpsr-o>tsoe>n(d)b.urfufn,( &anrcgcsl-S>hrmeecmv.bwuofrfk,) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53:: 562note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppg:r10o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ 10 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_60C:O Lnote: Lfield 'group' will be initialized after field 'stepSize'_ FUNC( A562l | l R e d utcied,( tCiOdL)L,N EnTt_hDrIeRaEdCsT(,n tShIrMePaLdEs,) ,P rteiMduIlnSBulmo,c kh(atlhfr)e a d| I^d x.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group )391, | | R ^~~~~~~~~~~u nWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO_##p:r562o:t15o:> (warning: )initializer order does not match the declaration order [-Wreorder-ctor]. run(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd s), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thr e563a | d I d x .sxt)e,p Sgirzoeu(pn(cgcrloSuhpm)e,m . c| o ^~~~~~~~~~~~~~~~~m m.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:S60i:z enote: sfield 'group' will be initialized after field 'stepSize'[ NCCL_ P562R | O T O _ StIiMdP(LtEi]d/)N,C CnLt_hSrTeEaPdSs/(snitzheroefa(dTs))), {t i d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n B l| o group(groupc k(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)687,: 11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (gro u687p | ) , | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: note: 563field 'nthreads' will be initialized after field 'tidInBlock' | st e562p | S i z e (tnicdc(ltSihdm)e,m .nctohmrme.abdusf(fnStihzreesa[dNsC)C,L _tPiRdOITnOB_lSoIcMkP(LtEh]r/eNaCdCILd_xS.TxE)P,S /gsriozuepo(fg(rTo)u)p ){, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'group' will be initialized after field 'stepSize'677 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 | t677i | d ( t i d ) , n t hprreiamdss((tnitdh-rteiaddSst)a,r ttBicdaIsntB,l oncTkh(rtehardesaBdcIadsxt.,x )&,d igrreocutp-(>goruotu,p )d,i r e| c ^~~~~~~~~~~t ->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :p rwarning: iinitializer order does not match the declaration order [-Wreorder-ctor]m s(tid-t i562d | S t a r ttSicda(tttiedr),, nnTthhrreeaaddssS(cnatthtreera,d sN)U,L Lt,i ddIinrBelcotc-k>(utph,r eaardgIsd-x>.sxe)n,d bgurfofu,p (agrrgosu-p>)r,e c v| b ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u f f| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i202z:e53(:n cnote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel Shmem. c202o | m m . b u f f S iRzuensW[oNrCkCELl_ePmReOnTtO<_FSnI,M PTL,E ]R/eNdCOCpL,_ SATlEgPoS,/ sPirzoetoof>((T)).)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::9666::19:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 9666 | | I M P L _ C O L Lp_rFiUmNsC((tAildl,R endTuhcree,a dCsOGLaLtNhEeTr_,D IdRiErCeTc,t -S>IuMpP,L EN,U LPLr,e MaurlgSsu-m>,s eunidnbtu6f4f_,t )a r g| s^- >rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hv:b391u:f95f:, note: expanded from macro 'IMPL_COLL_FUNC'| ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 :R53u:n Wnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer kA,l gNoC,C LP_rAoLtGoO>_(#)#.arlugno(,w eN)C;C L _| P ^R OTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp#:p10r:o1t:o >note: (in instantiation of member function 'RunWork, 2, 2>::run' requested here) .ru n10( | &InMcPcLl_SChOmLeLm_.FwUoNrCk()A;l l\R e d| u ^c e, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:E562T:_15D:I Rnote: Efield 'nthreads' will be initialized after field 'tidInBlock'C T, SIM P562L | E , P rteiMdu(ltSiudm),, hnatlhfr)e a d| s^( nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:) ,note: expanded from macro 'IMPL_COLL_FUNC't idInB l391o | c k (RtuhnrWeoardkI, 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 15| : ^~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hstepS:i562z:e15(:n cwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u641p:)11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 641 | 563 | s t e ppSriizmes((ntcicdl-SthimdeSmt.acrotmRme.dbuucfef,S inzTehsr[eNaCdCsLR_ePdRuOcTeO,_ SdIiMrPeLcEt]-/>NdCoCwLn_,S T&EdPiSr/escitz-e>oofu(tT,) )a r{g s -| > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s e n| d group(groupb uff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:c626v:b9u:f fnote: ,in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 :p rnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem s(ti d202- | t i d S t a r t SRcuantWtoerrk,E lneTmhernetaou>p(,) .arrugns(-w>es)e;n d b| u ^f f, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpps:-9>:r1e:c vnote: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ff, 9| | ^I MPL_COLL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hF:U202N:C53(:A lnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR educ e202, | C O L L N E T _RDuInRWEoCrTk,E lSeImMePnLtE<,F nP,r eTM,u lRSeudmO,p ,u iAnltg6o4,_ tP)r o t| o^> ().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n391(:w95e:) ;note: expanded from macro 'IMPL_COLL_FUNC' | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 10R:u1n:W onote: rin instantiation of member function 'RunWork, 2, 2>::run' requested herek R,E CNTC,C LS_IAMLPGLOE_,# #ParlegMou,l SNuCmC,L _hPaRlOfT)O _ #| #^p roto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:(391):.95r:u nnote: (expanded from macro 'IMPL_COLL_FUNC'& ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduc e562, | C O L LtNiEdT(_tDiIdR)E,C Tn,t hSrIeMaPdLsE(,n tPhrreeMaudlsS)u,m ,t ihdaIlnfB)l o c| k^( thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391I:d95x:. xnote: )expanded from macro 'IMPL_COLL_FUNC', group (391g | r o uRpu)n,W o r| k ^~~~~~~~~~~~~~~~~< ncclFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:#562#:f60u:n cnote: ,field 'group' will be initialized after field 'stepSize' type, 562F | u n c # #tdiedv(rteiddo)p,< tnytpher>e,a dNsC(CnLt_hArLeGaOd_s#)#,a ltgiod,I nNCCL_BPlRoOcTkO(_t#h#rperaodtIod>x(.)x.)r,u ng(r&onucpc(lgSrhomuemp.)w,o r k| ) ^~~~~~~~~~~; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 15| : ^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :9:1: 563note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here s t9e | pISMiPzLe_(CnOcLcLl_SFhUmNeCm(.AclolmRme.dbuucfef,S iCzOeLsL[NNECTC_LD_IPRREOCTTO,_ SSIIMMPPLLEE],/ NPCrCeLM_uSlTSEuPmS,/ suiizneto6f4(_Tt))) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : group(group391 :95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11 :391 | note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWor k677< | n c c l F u n c # # fpurnicm,s (ttyipde-,t iFduSntca#r#tdBecvarsetd,o pnout, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype>, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s), t i563d | I n B l osctke(ptShirzeea(dnIcdcxl.Sxh)m,e mg.rcooumpm(.gbruofufpS)i,z e s| [ ^~~~~~~~~~~N CCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h readI d563x | . x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~m .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCL_PROT:O562_:#15#:p rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]t o>().run(&ncclShm e562m | . w o r kt)i;d (\t i d| ) ^, nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~o mm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f60S:i znote: efield 'group' will be initialized after field 'stepSize's [NCCL _562P | R O T O _tSiIdM(PtLiEd])/,N CnCtLh_rSeTaEdPsS(/nstihzreeoafd(sT)),) t{i d I| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~B l o| c group(groupk (threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:g641r:o11u:p )note: ,in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15: note: field 'nthreads' will be initialized after field 'tidInBlock': 562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement:(562):.15r:u nwarning: (initializer order does not match the declaration order [-Wreorder-ctor]w e); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:i11d:(1t:i dnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, nth r11e | aIdMsP(Ln_tChOrLeLa_dFsU)N,C (tAildlIRneBdluoccek,( tChOrLeLaNdEITd_xD.IxR)E,C Tg,r oSuIpM(PgLrEo,u pP)r,e M u| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S u m| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) float )563 | | ^ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i391z:e95(:n cnote: cexpanded from macro 'IMPL_COLL_FUNC'l Shmem. c391o | m m .RbuunfWfoSrikz), {N C C| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ A L| G group(groupO _##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:C666L:_9P:R Onote: Tin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _##pr o666t | o > ( ) . r u np(r&inmcsc(ltSihdm,e mn.Twhorreka)d;s G\a t h| e ^r , dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562t:-15>:u pnote: ,field 'nthreads' will be initialized after field 'tidInBlock' NULL, 562a | r g s - >tsiedn(dtbiudf)f,, natrhgrse-a>drse(cnvtbhurfefa,d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k202(:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered Idx.x )202, | g r o u p ( g rRouunpW)o,r k E| l ^~~~~~~~~~~~~~~~~e men/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:<562F:n60,: Tnote: ,field 'group' will be initialized after field 'stepSize' RedOp ,562 | A l g o ,t iPdr(ottiod>)(,) .nrtuhnr(ewaed)s;( n t| h ^r eads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:i11d:I1n:B lnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herec k(th r11e | aIdMIPdLx_.CxO)L,L _gFrUoNuCp((AglrloRuepd)u,c e ,| ^~~~~~~~~~~C OLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppN:E13T:_1D:I Rnote: Ein instantiation of member function 'RunWork, 2, 2>::run' requested hereC T, SIMP L13E | ,I MPPrLe_MCuOlLSLu_mF,U NdCo(uAbllleR)e d u| c^e , CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:L391N:E95T:_ Dnote: Iexpanded from macro 'IMPL_COLL_FUNC'R ECT, S I391M | P L ER,u nPWroerMku , NCCL_ A391L | G O _R#u#naWlogrok,< nNcCcClLF_uPnRcO#T#Of_u#n#cp,r ottyop>e(,) .Fruunnc(#&#ndcecvlrSehdmoepm<.twyoprek>),; N\C C L| _ ^A LGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, note: Nfield 'nthreads' will be initialized after field 'tidInBlock'C CL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~t id)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~~~~~~~e adId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)60,: gnote: rfield 'group' will be initialized after field 'stepSize'o up(gr o562u | p ) , t| i ^~~~~~~~~~~d (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTOprims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_# #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u15n:W owarning: rinitializer order does not match the declaration order [-Wreorder-ctor]k Element((n)t.hrruena(dwse)),; t i| d ^I nBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp(:t12h:r1e:a dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hered x.x) ,12 | gIrMoPuLp_(CgOrLoLu_pF)U,N C (| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l l R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uce, C563O | L L N E Ts_tDeIpRSEiCzTe,( nScIcMlPSLhEm,e mP.rceoMmuml.Sbuumf,f Sdiozuebsl[eN)C C L| _^P ROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:I391M:P95L:E ]note: /expanded from macro 'IMPL_COLL_FUNC'N CCL_ST E391P | S / sRiuzneWoofr(kT<)n)c c{l F u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c # #| f group(groupu nc, type,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :F641u:n11c:# #note: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree vredop <641t | y p e > , N C C L _pArLiGmOs_(#t#iadl-gtoi,d SNtCaCrLt_RPeRdOuTcOe_,# #npTrhorteoa>d(s)R.erduunc(e&,n cdcilrSehcmte-m>.dwoowrk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~d s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hreads):,562 :t15i:d Iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]B lock(threadIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n threads), t563i | d I n B lsotcekp(Stihzree(andcIcdlxS.hxm)e,m .gcroomump.(bgurfofuSpi)z,e s [| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C C L| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)P ROTO _563S | I M P L Es]t/epNSCiCzLe_(SnTcEcPlSS/hsmiezme.ocfo(mTm).)b u{f f S| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~z e s| [ group(groupN CCL_PROTO_SIMPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C666C:L9_:S Tnote: Ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP S/siz e666o | f ( T ) ) { p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i m s| ( group(groupt id, nThreadsGat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:e655r:,11 :d inote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ct->up ,655 | N U L L , a r g s -p>rsiemnsd(btuifdf-,t iadrSgtsa-r>trReecdvubcuef,f ,n T h| r ^e adsRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:e202,: 53n:u lnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep tr, &202d | i r e c t - > o uRtu,n WaorrgksE-l>esmeenndtbOrpe,c vAblugfof,, P r| o ^t o>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(202w:e53):; note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 13 : 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work E13l | eImMePnLt_E(T)_.DrIuRnE(CwTe,) ;S I M| P ^L E, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppe:M11u:l1S:u mnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here rccl _11b | fIlMoPaLt_1C6O)L L _| F^U NC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:l391R:e95d:u cnote: eexpanded from macro 'IMPL_COLL_FUNC', COLLN E391T | _ D IRRuEnCWTo,r kS, NCC L391_ | A L GROu_n#W#oarlkg (F)u.nrcu#n#(d&envcrceldSohpmr,k )N;C C\L _ A| L ^G O_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:o562,: 15N:C Cnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ad/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), t562i | d I n B ltoicdk((tthreadIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B60l:o cnote: kfield 'group' will be initialized after field 'stepSize'( thre a562d | I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp), :| 562 ^~~~~~~~~~~: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g: r 562owarning: u | initializer order does not match the declaration order [-Wreorder-ctor]p ) , t| i ^~~~~~~~~~~~~~~~~d (562t | i d /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h) :, 562 t:ni60td:h (rnote: tefield 'group' will be initialized after field 'stepSize'ia dd)s,( nnt th562hr | re eads), tid aI dn sBt(ilndot(cthkir(det)ah,dr sen)ta,hd rItediaxdd.Isxn()Bn,lt ohgcrrkeo(autdphs(rg)er,ao dutIpid)dx,I. nxB )l| ,o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ c gk r(| ot tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)uh pr(ega rd563oI | ud px ). ,x )s ,t| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~gp rS oi| uz tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)pe ((gnrc oc563ulp | S) h, m e m| s. ^~~~~~~~~~~tc eopmSmi.zbeu(fnfcScilzSehsm[eNmC.CcLo_mPmR.ObTuOf_fSSIiMzPeLsE[]N/CNCCLC_LP_RSOTTEOP_SS/IsMiPzLeEo]f/(NTC)C)L _{S T E| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S / s| i group(groupz eof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :{626 : 9| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| group(group 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626 : 9 : note: pin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer ims(t i626d | - t i d S t a r tpSrciamtst(etri,d -ntTihdrSetaadrstSSccaatttteerr,, NnUTLhLr,e addisrSeccatt-t>eurp,, NaUrLgLs,- >dsiernedcbtu-f>fu,p ,a ragrsg-s>-r>escevnbdubfuff,f , | a ^r gs->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:v53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : Rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Work E202l | e m e n t < F n ,R uTn,W oRrekdEOlpe,m eAnltge(d)O.pr,u nA(lwgeo),; P r| o ^t o>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:u12n:(1w:e )note: ;in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 12 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppP:L13_:C1O:L Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereF UNC( A13l | lIRMePdLu_cCeO,L LC_OFLULNNCE(TA_lDlIRReEdCuTc,e ,S ICMOPLLLEN,E TP_rDeIMRuElCSTu,m ,S IdMoPuLbEl,e )P r e| M^u lSu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:,391 :r95c:c lnote: _expanded from macro 'IMPL_COLL_FUNC'b float1 6391) | | R^u nWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391<:n95c:c lnote: Fexpanded from macro 'IMPL_COLL_FUNC'u nc##fu n391c | , tRyupneW,o rFkuy,p eN,C CFLu_nAcL#G#Od_e#v#raeldgoop,< tNyCpCeL>_,P RNOCTCOL__#A#LpGrOo_t#o#>a(l)g.or,u nN(C&CnLc_cPlRSOhTmOe_m#.#wporrokt)o;> (\) . r| u ^n (&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m enote: mfield 'nthreads' will be initialized after field 'tidInBlock'. work )562; | \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t ^ id(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t idInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I60d:x .note: xfield 'group' will be initialized after field 'stepSize') , group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~i d),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~h readIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:E562T:_15D:I Rwarning: Einitializer order does not match the declaration order [-Wreorder-ctor]C T, SIMPLE, 562P | r e M u ltSiudm(,t irdc)c,l _nbtfhlroeaatd1s6()n t h| r^e ads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391t:i95d:I nnote: Bexpanded from macro 'IMPL_COLL_FUNC'l ock(t h391r | e a dRIudnxW.oxr)k,< ngcrcoluFpu(ngcr#o#ufpu)n,c , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~y p e| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Func##d e563v | r e d o psz,e (NnCcCcLl_SAhLmGeOm_.#c#oamlmg.ob,u fNfCSCiLz_ePROTsO[_N#C#CpLr_oPtRoO>T(O)_.SrIuMnP(L&En]c/cNlCSChLm_eSmT.EwPoSr/ks)i;z e\o f (| T ^) ) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 : 15| : group(group note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 666 : 9 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), 666n | t h r e a d s ( nptrhirmesa(dtsi)d,, tniTdhIrneBaldoscGka(tthherre,a ddIidrxe.cxt)-,> ugpr,o uNpU(LgLr,o uapr)g,s - >| s ^~~~~~~~~~~~~~~~~e ndb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:f562f:,60 :a rnote: gfield 'group' will be initialized after field 'stepSize's ->rec v562b | u f f , t i| d ^( tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] In file included from 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp : 1 : In file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:n10t: 3In file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h_:t168 : d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ha:t153a:114,: fwarning: lunused variable 'data1' [-Wunused-variable]a g1, data2, f l153a | g 2 ; u| i ^~~~~n t32_t data1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :f153l:a35g:1 ,warning: unused variable 'flag2' [-Wunused-variable]d ata2 ,153 | f l a g 2u;i n t| 3 ^~~~~2 _t data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h1:,153 :f21l:a gwarning: 1unused variable 'flag1' [-Wunused-variable], da t153a | 2 , f luaign2t;3 2 _| t ^~~~~ data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), 563 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t epSi z563e | ( n c c lsSthempeSmi.zceo(mnmc.cbluSfhfmSeimz.ecso[mNmC.CbLu_fPfRSOiTzOe_sS[INMCPCLLE_]P/RNOCTCOL__SSITMEPPLSE/]s/iNzCeCoLf_(STT)E)P S{/ s i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e o f| ( group(groupT )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here87 :62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here149 | 87 | P r i m i tPirviemsi<,0 ,0 ,1 >P,r o0t,o ,P r1o>t op,r i1m>s p r| i ^m s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h::9226:: 9note: :in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 212 | 226 | r u n RreucnvS1>(,t4i>d>,( tnitdh,r enatdhsr,e agdrso,u pg,r oaurpg,s )a;r g s| ) ^; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp :4:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cppnote: :in instantiation of member function 'RunWork, 1, 2>::run' requested here4 :1: note: 4in instantiation of member function 'RunWork, 1, 2>::run' requested here | IMPL _4C | OILMLP_LF_UCNOCL(LS_eFnUdNRCe(cSve,n dRRIeNcGv,, SRIIMNPGL,E ,S ISMuPmL,E ,i nStu8m_,t )i n t| 8^_ t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h^: 391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :expanded from macro 'IMPL_COLL_FUNC'391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru n391W | o r kRp,< tNyCpCeL>_,A LNGCOC_L#_#AaLlGgOo_,# #NaClCgLo_,P RNOCTCOL__#P#RpOrToOt_o#>#(p)r.ortuon>((&)n.crculnS(h&mnecmc.lwSohrmke)m;. w\o r k| ) ^; \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 60field 'group' will be initialized after field 'stepSize': note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~ | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr (tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dstIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ , *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp10::1 : warning: In file included from variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :13: In file included from 154/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h | : 169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h :c271a:s19e: 3warning: :unused variable 'ptr' [-Wunused-variable] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp :2715 | : 9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here ui n5t | 6 4 _ t * p t rM S=C CrLe_cIvMPPtLr_(K0E)R+NlElL1_2E8NOTfRfYs_eFtU;N C _| D ^~~E VREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | voi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ d *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217 case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable]i ce*warp +271 | 2 * w i d ; | u ^ int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreterDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ , ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeofIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter ,warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]P rotoLL128, 154f | u l l O pcsa>s(ec o3m:m , | a ^l go, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, byteIn file included from s)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp;: 1 : | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^~~: 154:10:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :warning: 162variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 154162 | | cdaesfea u3l:t : | ^| ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5 :1659 | : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here copyT o5S | h m e m 8 ( t i dM%SWCACRLP__ISMIPZLE_,K EdRsNtE,L _sErNcT,R Yb_yFtUeNsC)_;D E V| R ^~~E DOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :165134 | : 14 : note: cinitialize the variable 'dst' to silence this warningo pyTo S134h | m e m 8 (vtoiidd% W*AdRsPt_,S I*ZsEr,c ;d s t| , ^ s r| c = nullptr, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:9::154 :note: 10in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 5 | 154 | M ScCaCsLe_ I3M:P L _| K ^E RNEL_ENTR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cppY:_5FU:N9C:_ Dnote: Ein instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereV RED O5P | _ T Y P E ( S u mM,S CiCnLt_6I4M_PtL,_ KfEaRlNsEeL)_;E N T| R ^Y _FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:_399D:E3V:R Enote: Dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'O P_TYP E399( | S u mm,s cicnltR6u4n_Itn,t efraplrseet)e;r < t| y ^p e, Fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:c405#:#3d:e vnote: rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e dop , mPsrcoctloRLuLn,I nftuelrlpOrpest>e(rc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 165P:r33o:t onote: Suninitialized use occurs herei mple <165M | S C C L _cCoHpUyNTKoSSThEmPeSm/8M(StCiCdL%_WSALRIPC_ESSITZEEP,S ,d sMtS,C CsLr_cS,L IbCyEtSeTsE)P;S > ,| ^~~f ullOps/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h>:(162c:o5m:m ,warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]a lg o162, | w o r kd)e;f a\u l t| : ^ | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h165::16533::33 :note: uninitialized use occurs herenote: uninitialized use occurs here 165165 | | ccooppyyTTooSShhmmeemm88((ttiidd%%WWAARRPP__SSIIZZEE,, ddsstt,, ssrrcc,, bbyytteess));; | | ^~~ ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ICESTEPS, MSCCL_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpIn file included from r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cppe:t1e: rIn file included from ,507 | P r o t otLiLd,( tfiudl)l,O pnst>h(rceoamdms,( natlhgroe,a dwso)r,k )w;i d\( t i| d ^% WARP_SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):,165 :w33a:r pnote: (uninitialized use occurs heret id/WA R165P | _ S I Z Ec)o,p y T| o ~~~~~~~~~~~~~~~~~~S h m| e stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)m 8(t i508d | % W A R Pw_aSrIpZIEn,B ldosctk,( tshrrce,a dbIydtxe.sx)/;W A R| P ^~~_ SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):,162 : 5| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]| warp(tid/WARP_SIZE 162 | 509 | d e f afullatg:T h r| e ^~~~~~~a d(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t165i:d33%:4 )note: =uninitialized use occurs here= 3), g165r | o u p ( gcroopuypT)o,S h m| e ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~m 8 (| t warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3i d%WARP _510S | I Z E , sdtsetp,S iszrec(,n cbcyltSehsm)e;m . c| o ^~~m m.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:r134o:t14o:, note: 0initialize the variable 'dst' to silence this warning> pri m134s | | ^ void/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp :*5d:s9t:, note: *in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heres rc; 5| | ^ | = nullptr MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | intIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cppo:f1f: sIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht: 13=: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hi:d168;: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :| 153 ^: 14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ dx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp::351:: In file included from warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hunused variable 'flag2' [-Wunused-variable]: 13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h153: | 168 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153u:i14n:t 3warning: 2unused variable 'data1' [-Wunused-variable]_ t data1, 153f | l a g 1 ,u idnatt3a22_,t fdlaatga21;, f| l ^~~~~a g1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ : note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSIn file included from C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cppC:L1_: IIn file included from M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:L13_: KIn file included from E/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hR:N169E: L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h_:E509N:T29R:Y _warning: Ffield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]U NC_DEVREDOP _507T | Y P E ( Stuimd,( thiadl)f,, nftahlrseea)d;s ( n| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)405,: 3w:i dnote: (expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't id%WARP _405S | I Z Em)s,c cwlaRrupn(Itnitde/rWpArRePt_eSrIr,p IPnrBoltoocSki(mtphlreed,% 4f)u=l=l3O)p,s >g(rcooumpm(,g raolugpo),, w o| r ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~k ) ;| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3\ | ^ 510 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165 :s33t:e pnote: Suninitialized use occurs herei ze(nc c165l | S h m e mc.ocpoymTmo.SbhumfefmS8i(zteisd[%NWCACRLP__PSRIOZTEO,_ LdLs1t2,8 ]s/rNcC,C Lb_yStTeEsP)S;/ s i| z ^~~e of(uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h6:4162_:t5):) warning: {variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~162 | | group(group default: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^~~~~~~: 217:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h57::165 :note: 33in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here: note: uninitialized use occurs here 217 | 165 | P r i mciotpiyvTeosSc,, 1b,y tPerso)t;o , | 0 ^~~> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :m134s:c14c:l Rnote: uinitialize the variable 'dst' to silence this warningn Int e134r | p r e t evro, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ erpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hds), ti:d386I:n9B:l owarning: cvariable 'wireOffset' set but not used [-Wunused-but-set-variable]k (threadIdx. x386) | , g r oiunpt( gwrioruepO)f,f s e| t ^~~~~~~~~~~ = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h1:: 162In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h5::13 : warning: In file included from variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :169 : 162/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h | : 509 : 29 :d ewarning: ffield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]a ult: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h507: | 165 : 33 : tnote: iuninitialized use occurs hered (tid )165, | n t h rceoapdysT(onSthhmreema8d(st)i,d %wWiAdR(Pt_iSdI%ZWEA,R Pd_sStI,Z Es)r,c ,w abrypt(etsi)d;/ W A| R ^~~P _SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134 :s14t:e pnote: Sinitialize the variable 'dst' to silence this warningi ze(n c134c | l S h m evmo.icdo m*md.sbtu,f f*Ssirzce;s [ N| C ^C L _| P = nullptrR OTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ id(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll12In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 8Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread(In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr (tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp509::529::9 :warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5507 | | t i d (MtSiCdC)L,_ InMtPhLr_eKaEdRsN(EnLt_hErNeTaRdYs_)F,U NwCi_dD(EtViRdE%DWOAPR_PT_YSPIEZ(ES)u,m ,w arrcpc(lt_ibdf/lWoAaRtP1_6S,I ZfEa)l,s e )| ; ~~~~~~~~~~~~~~~~~~ | | ^ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 :w anote: rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'p InBlock (405t | h r emasdcIcdlxR.uxn/IWnAtRePr_pSrIeZtEe)r,< t y| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e , | F warp(tid/WARP_SIZEu nc# #509d | e v r e dfolpaa,d (P(rtoitdo%S4i)m=p=l3e)<,M SgCrCoLu_pC(HgUrNoKuSpT)E,P S /| M ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~S C C| L warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3_ SLICES T510E | P S , MsStCeCpLS_iSzLeI(CnEcScTlESPhSm>e,m .fcuolmlmO.pbsu>f(fcSoimzme,s [aNlCgCoL,_ PwRoOrTkO)_;L L\1 2 8| ] ^/ NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hT:E165P:S33/:s inote: zuninitialized use occurs heree of(ui n165t | 6 4 _ t )c)o p{y T o| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h m e| m group(group8 (tid%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S217I:Z57E:, note: din instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested heres t, sr c217, | b yPtreism)i;t i v| e ^~~s a,u l1t, Proto,: 0 >| ^~~~~~~p rim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs: 165 :| 33 ^: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp :1655 | : 9 : note: cin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereo pyToS h5m | e m 8 ( t i d % WMASRCPC_LS_IIZMEP,L _dKsEtR,N EsLr_cE,N TbRyYt_eFsU)N;C _ D| E ^~~V REDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dsIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr t, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1d: (In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:d10): ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:t167h: r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~b u f| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S izes[ N563C | C L _ P RsOtTeOp_SSiIzMeP(LnEc]c/lNSChCmLe_mS.TcEoPmSm/.sbiuzfefoSfi(zTe)s)[ N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P R O| T group(groupO _SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h]:/33N:C7C:L _note: Sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereT EPS/si z33e | o f ( T ) ) p{r i m| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group, nthreads, &ri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hn:g33-:>7p:r enote: vin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here, &ring -33> | n e x t , aprrgism-s>(steindd,b unftfh,r eaardgss,- >&rreicnvgb-u>fpfr,e va,r g&sr-i>nrge-d>OnpeAxrtg,, a0r,g sa-r>gsse-n>dcbounfnfI,n daerxg,s -a>rrgesc-v>bcuofnfn,I nadregxs)-;> r e| d ^O pArg,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :078,: 5a:r gnote: sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here- >co n78n | I n d e xr,u naRrignsg-<>Tc,o nRneIdnOdpe,x )P;r o t| o ^> (args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h):;78 : 5: note: | in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here ^ 78 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202r:u53n:R inote: nin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereg n(WaorrgksE)l;e m e| n ^t , 1, 2>::run' requested hereA lgo, 202P | r o t o > ( ) . rRuunn(Wwoer)k;E l e| m ^e nt, 1, 2>::run' requested here, Alg o6, | IPMrPoLt_oC>O(L)L._rFuUnN(Cw(eR)e;d u c| e ^S catter,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp :R4I:N1G:, note: Sin instantiation of member function 'RunWork, 1, 2>::run' requested hereI MPLE ,4 | SIuMmPPLo_sCtODLiLv_,F UiNnCt(3R2e_dtu)c e S| c^a tter/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391R:I95N:G ,note: expanded from macro 'IMPL_COLL_FUNC'S IMPLE, 391S | u m PRousntWDoirvk,< nicnctl8F_utn)c # #| f^u nc, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:y391p:e95,: Fnote: uexpanded from macro 'IMPL_COLL_FUNC'n c##dev r391e | d o pRk,< nNcCcClLF_uAnLcG#O#_f#u#nacl,g ot,y pNeC,C LF_uPnRcO#T#Od_e#v#rperdootpo<>t(y)p.er>u,n (N&CnCcLc_lASLhGmOe_m#.#waolrgko),; N\C C L| _ ^P ROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:p15r:o tnote: ofield 'nthreads' will be initialized after field 'tidInBlock'> ().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBl o562c | k ( tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L15_:F Uwarning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C (Reduce S562c | a t t e rt,i dR(ItNiGd,) ,S InMtPhLrEe,a dSsu(mnPtohsrteDaidvs,) ,u itnitd8I_ntB)l o c| k^( thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391I:d95x:. xnote: )expanded from macro 'IMPL_COLL_FUNC', group (391g | r o uRpu)n,W o r| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~< n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Func# #563f | u n c , sttyeppeS,i zFeu(nncc#c#ldSehvmreemd.ocpof,f SNiCzCeLs_[ANLCGCOL__#P#RaOlTgOo_,S INMCPCLLE_]P/RNOCTCOL__#S#TpErPoSt/os>i(z)e.orfu(nT()&)n c{c l S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| . group(groupw ork); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h\: 33 :| 7 ^: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: 33note: | field 'nthreads' will be initialized after field 'tidInBlock' p562r | i m s ( ttiidd,( tnitdh)r,e andtsh,r e&ardisn(gn-t>hprreeavd,s )&,r itnigd-I>nnBelxotc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:r60e:d Onote: pfield 'group' will be initialized after field 'stepSize'A rg, 0 ,562 | a r g s -t>icdo(ntniIdn)d,e xn,t harregasd-s>(cnotnhnrIenaddesx)),; t i| d ^I nBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hk:(78t:h5r:e anote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereI dx. x78) | , g r oruupn(Rgirnogu(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), 563| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s tepSi z563e | ( n c c lsSthempeSmi.zceo(mnmc.cbluSfhfmSeimz.ecso[mNmC.CbLu_fPfRSOiTzOe_sS[INMCPCLLE_]P/RNOCTCOL__SSITMEPPLSE/]s/iNzCeCoLf_(STT)E)P S{/ s i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e o f| ( group(groupT )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :3333 | : 7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here prims(tid ,33 | n t h r e a dpsr,i m&sr(itnigd-,> pnrtehvr,e a&drsi,n g&-r>innegx-t>,p raervg,s -&>rsienngd-b>unfefx,t ,a ragrsg-s>-r>escevnbdubfuff,f ,a ragrsg-s>-r>erdeOcpvAbrugf,f ,0 ,a ragrsg-s>-r>ecdoOnpnAIrngd,e x0,, aarrggss-->>ccoonnnnIInnddeexx),; a r| g ^ s-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h>:c78o:n5n:I nnote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree x); 78 | | ^ runR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:n78g:<5T:, note: Rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree dOp ,78 | P r o t or>u(naRrignsg)<;T , | R ^e dOp, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:o202>:(53a:r gnote: sin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here) ; | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u202n:W53o:r knote: Ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested herel emen t202< | F n , T , R eRduOnpW,o rAklEgloe,m ePnrto,( )T.,r uRne(dwOep),; A l| g ^o , Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cppo:>9(:)1.:r unote: nin instantiation of member function 'RunWork, 1, 2>::run' requested here( we); 9 | | I ^M PL_COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp_:F7U:N1C:( Rnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested hered uceS c7a | tItMePrL,_ CROILNLG_,F USNICM(PRLeEd,u cSeuSmcPaotstteDri,v ,R IuNiGn,t 6S4I_MtP)L E ,| ^S umPos/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:D391i:v95,: unote: iexpanded from macro 'IMPL_COLL_FUNC'n t32_t) 391 | | ^ RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r391k:<95n:c cnote: lexpanded from macro 'IMPL_COLL_FUNC'F unc##f u391n | c , RtuynpWeo,r kFp,e ,N CFCuLn_cA#L#GdOe_v#r#eadlogpo<,t yNpCeC>L,_ PNRCOCTLO__A#L#GpOr_o#t#oa>l(g)o.,r uNCnC(L&_nPcRcOlTSOh_m#e#mp.rwootrok>)(;) .\r u n| ( ^& ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:w15o:r knote: )field 'nthreads' will be initialized after field 'tidInBlock'; \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~x ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), 562 | | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n threa d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~x .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter ,154 | P r o t ocLaLs1e2 83,: f u| l ^l Ops>(comm, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cppl:g5o:,9 :w onote: rin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herek ); \ | 5 ^ | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = Wir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr eWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOpsIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ >(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInteIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr rpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(iPzreo(dn,c culiSnhtm6e4m_.tc,o mfma.lbsuef)f;S i z| e ^s [NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:_399P:R3O:T Onote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'L L128]/ N399C | C L _mSsTcEcPlSR/usniIznetoefr(purientte6r4<_tty)p)e ,{ F u| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c # #| d group(groupe vredop, ProtoL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:,217 :f57u:l lnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herep s>(comm, 217a | l g oP,r iwmoirtki)v;e s\< T ,| ^R edOp, FanAs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hy:m165m:e33t:r inote: cuninitialized use occurs here< 1,1>, 1165, | P r o tcoo,p y0T>o Sphrmiemms8 ( t| i ^d %WARP_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cppI:Z5E:,9 :d snote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here, src ,5 | b y t e s ) ; M| S ^~~C CL_IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:K162E:R5N:E Lwarning: _variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]E NTR Y162_ | F U N C _dDeEfVaRuElDtO:P _ T| Y ^~~~~~~P E(P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:o165d:,33 :u inote: nuninitialized use occurs heret 64_t ,165 | f a l s ec)o;p y T| o ^S hmem8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t402i:d3%:W Anote: Rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'P _SIZE ,402 | d s tm,s cscrlcR,u nbIyntteesr)p;r e t| e ^~~r , ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, alIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ go, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, ds/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hid),: 154n:t10h:r ewarning: avariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]d s(nthr e154a | d s ) , ctaisdeI n3B:l o c| k ^( threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp :g5r:o9u:p (note: gin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herer oup), 5 | | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :M60S:C Cnote: Lfield 'group' will be initialized after field 'stepSize'_ IMPL _562K | E R N E Lt_iEdN(TtRiYd_)F,U NnCt_hDrEeVaRdEsD(OnPt_hTrYePaEd(sP)r,o dt,i duIinnBtl8o_ctk,( tfharlesaed)I;d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:(405g:r3o:u pnote: )expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE', | ^~~~~~~~~~~ 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here c c202l | F u n c # # f u nRcu,n WtoyrpkeE,l eFmuennct#<#Fdne,v rTe,d oRpeA,l gNoC,C LP_rAoLtGoO>_(#)#.arlugno(,w eN)C;C L _| P ^R OTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppp:r7o:t1o:> (note: )in instantiation of member function 'RunWork, 1, 2>::run' requested here. run( &7n | cIcMlPSLh_mCeOmL.Lw_oFrUkN)C;( R\e d u| c ^e Scatter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:I562N:G15,: Snote: Ifield 'nthreads' will be initialized after field 'tidInBlock'M PLE, Su m562, | u i n tt3i2d_(tt)i d )| ,^ nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391a:d95s:( nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads )391, | t iRduInnWBolrokc60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p), | 562 ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppp:(1g: rIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hu:p10): ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :| 167 ^~~~~~~~~~~~~~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 60warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d ( ttiid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d )N,C CnLt_hArLeGaOd_s#(#natlhgroe,a dNsC)C,L _tPiRdOITnOB_l#o#cpkr(otthor>e(a)d.Irduxn.(x&)n,c cglrSohumpe(mg.rwoourpk)),; \| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : note: sfield 'nthreads' will be initialized after field 'tidInBlock't epSiz e562( | n c c l Sthimde(mt.icdo)m,m .nbtuhfrfeSaidzse(sn[tNhCrCeLa_dPsR)O,T Ot_iSdIIMnPBLlEo]c/kN(CtChLr_eSaTdEIPdSx/.sxi)z,e ogfr(oTu)p)( g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | group(group| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:: 33note: :field 'group' will be initialized after field 'stepSize'7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 562 | 33t | i d ( t i d )p,r inmtsh(rteiadd,s (nntthhrreeaaddss,) ,& rtiindgI-n>Bplroecvk,( t&hrrienagd-I>dnxe.xxt),, agrrgosu-p>(sgernodubpu)f,f , | a ^~~~~~~~~~~r gs->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize': 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgro:u562p:)15,: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 562 | s tteipdS(itzied()n,c cnltShhrmeeamd.sc(onmtmh.rbeuafdfsS)i,z etsi[dNICnCBLl_oPcRkO(TtOh_rSeIaMdPILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) {| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hS:i33z:e7(:n cnote: cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herel Shmem.co m33m | . b u f f S ipzreism[sN(CtCiLd_,P RnOtThOr_eSaIdMsP,L E&]r/iNnCgC-L>_pSrTeEvP,S /&sriiznego-f>(nTe)x)t ,{ a r| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s - >| s group(groupe ndbuff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hg:s33-:>7r:e cnote: vin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereb uff, arg s33- | > r e d O p Aprrgi,m s0(,t iadr,g sn-t>hcroenandIsn,d e&xr,i nagr-g>sp-r>ecvo,n n&Irnidnegx-)>;n e x| t ^, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hs:-78>:s5e:n dnote: bin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereu ff, 78a | r g s - >rruencRvibnugf,r ePdrOoptAor>g(,a r0g,s )a;r g s| - ^> connI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:d202e:x53,: anote: rin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereg s->c o202n | n I n d e x ) ; R u| n ^W ork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hE:l78e:m5e:n tnote: , ProtoSimple<2, 2>>' requested hereF n, T78, | R e d Orpu,n RAilnggo<,T ,P rRoetdoO>p(,) .Prruont(ow>e()a;r g s| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of member function 'RunWork, 1, 2>::run' requested here: 53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here6 | IMP L202_ | C O L L _ F U N CR(uRneWdourckeESlceamtetnetr<,F nR,I NTG,, RSeIdMOPpL,E ,A lSguom,, Pirnott3o2>_(t)). r u| n^( we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp :13:1: note: 391in instantiation of member function 'RunWork, 1, 2>::run' requested here | Ru n13W | oIrMkPu,m ,N CrCcLc_lA_LbGfOl_o#a#ta1l6g)o , | N^C CL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:O391T:O95_:# #note: pexpanded from macro 'IMPL_COLL_FUNC'r oto>() .391r | u n (R&unncWcolrSkhi,d (NtCiCdL)_,A LnGtOh_r#e#aadlsg(on,t hNrCeCaLd_sP)R,O TtOi_d#I#npBrlootcok>((t)h.rreuand(I&dnxc.cxl)S,h mgermo.uwpo(rgkr)o;u p\) , | ^| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6015:: note: note: field 'group' will be initialized after field 'stepSize'field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]u nWorkElem e562n | t < F n ,t iTd,( tRiedd)O,p ,n tAhlrgeoa,d sP(rnotthor>e(a)d.sr)u,n (twied)I;n B l| o ^c k(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppe:a13d:I1d:x .note: xin instantiation of member function 'RunWork, 1, 2>::run' requested here) , g r13o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C ( R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d uceSca t563t | e r , RsItNeGp,S iSzIeM(PnLcEc,l SShumme,m .rccocmlm_.bbfulfofaSti1z6e)s [ N| C^C L_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T391O:_95S:I Mnote: Pexpanded from macro 'IMPL_COLL_FUNC'L E]/NC C391L | _ S TREuPnSW/osrikz, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereo p, 33N | C C L _ A L GpOr_i#m#sa(ltgiod,, NnCtChLr_ePaRdOsT,O _&#r#ipnrgo-t>op>r(e)v.,r u&nr(i&nngc-c>lnSehxmte,m .awrogrsk-)>;s e\n d b| u ^f f, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15r:e cnote: vfield 'nthreads' will be initialized after field 'tidInBlock'b uff, a562r | g s - > rteiddO(ptAirdg),, 0n,t harregasd-s>(cnotnhnrIenaddesx),, atrigdsI-n>BclooncnkI(ntdherxe)a;d I d| x ^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ho/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:u:78p562:(:5g15:r: o note: uwarning: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herepinitializer order does not match the declaration order [-Wreorder-ctor] ) , 78 | | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h r: u562 n: R60ti:in dgnote: (it(dha)rr,eg asnd)ts;h) r, e | at ^di sd(InntBh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hlr:oe202ca:kd53(s:t) h,note: r in instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereet aiddII dn202xB | .l xo )c ,k ( gt rh or ueRpau(dngIWrdooxru.kpxE))l,,e m ge| rn ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ot u< pF| (n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g, r oTu ,p563 ) | R, e d O | ps ^~~~~~~~~~~,t eAplSgioz,e (PnrcoctloS>h(m)e.mr.ucno(mwme.)b;u f f| S ^i zes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppL:_12P:R1O:T Onote: _in instantiation of member function 'RunWork, 1, 2>::run' requested hereS IMPL E12] | /INMCPCLL__CSOTLELP_SF/UsNiCz(eRoefd(uTc)e)S c{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ atter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, 386N | C C L _ AiLnGtO _w#i#raelOgfof,s eNtC C=L _WPiRrOeTWOo_r#d#PperroStloi>c(e)*.wraurnp( &+n c2c*lwSihdm;e m .| w ^o rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, 1, 2>::run' requested heren c, type, 202F | u n c # # d e v rReudnoWpom,e nNtC_(#)#.prruont(ow>e());. r u| n ^( &ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cppr:k7):;1 :\ note: in instantiation of member function 'RunWork, 1, 2>::run' requested here| ^ 7 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_15C:O Lnote: Lfield 'nthreads' will be initialized after field 'tidInBlock'_ FUNC( R562e | d u c e Stciadt(tteird,) ,R InNtGh,r eSaIdMsP(LnEt,h rPeraodds,) ,u itnitd3I2n_Btl)o c k| (^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d391x:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(g r391o | u p )R,u n W| o ^~~~~~~~~~~~~~~~~r kh,r eNaCdCsL)_,A LtGiOd_I#n#Ballogcok,( tNhCrCeLa_dPIRdOxT.Ox_)#,# pgrrootuop>((g)r.oruupn)(,& n c| c ^~~~~~~~~~~l Shmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:: 562note: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]202 | RunWork 562 | E l etmiedn(tti(d)I.nrBulno(cwke()t;h r e| a ^d Idx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp):,11 :g1r:o unote: pin instantiation of member function 'RunWork, 1, 2>::run' requested here( gro u11p | )I,M P L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C O L| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ FUNC(R e563d | u c e S csattetpeSri,z eR(InNcGc,l SShImMePmL.Ec,o mPmr.obdu,f ffSliozaets)[ N C| C^L _PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:O391_:S95I:M Pnote: Lexpanded from macro 'IMPL_COLL_FUNC'E ]/NCC L391_ | S T ERPuSn/Wsoirzkenote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here NCCL_AL G33O | _ # # a l g op,r iNmCsC(Lt_iPdR,O TnOt_h#r#epardost,o >&(r)i.nrgu-n>(p&rnecvc,l S&hrmienmg.-w>onrekx)t;, \a r g| s ^- >sen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:b562u:f15f:, note: afield 'nthreads' will be initialized after field 'tidInBlock'r gs-> r562e | c v b u ftfi,d (atrigds)-,> rnetdhOrpeAardgs,( n0t,h raeragdss-)>,c otnindIInndBelxo,c ka(rtghsr-e>acdoIndnxI.nxd)e,x )g;r o u| p ^( group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h : 78| : ^~~~~~~~~~~~~~~~~5 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here: 562:60 :78 | note: field 'group' will be initialized after field 'stepSize' ru n562R | i n g < Tt,i dR(etdiOdp),, Pnrtohtroe>a(dasr(gnst)h;r e a| d ^s ), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l202o:c53k:( tnote: hin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer eadI d202x | . x ) , g r o uRpu(ngWroorukpE)l,e m e| n ^~~~~~~~~~~t ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r15k:) ;warning: initializer order does not match the declaration order [-Wreorder-ctor]\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( group )563, | | ^~~~~~~~~~~~~~~~~ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60(:n cnote: cfield 'group' will be initialized after field 'stepSize'l Shmem .562c | o m m . btuifdf(Stiizde)s,[ NnCtChLr_ePaRdOsT(On_tShIrMePaLdEs])/,N CtCiLd_ISnTBElPoSc/ks(itzheroefa(dTI)d)x .{x ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h):,33 : 7| : ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWs); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgroup):, 562 :| 15 ^~~~~~~~~~~: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h167:: 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::15562:: 15warning: :initializer order does not match the declaration order [-Wreorder-ctor] warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t e psStiezpeS(inzcec(lnSchcmleSmh.mceomm.mc.obmumf.fbSuifzfeSsi[zNeCsC[LN_CPCRLO_TPOR_OSTIOM_PSLIEM]P/LNEC]C/LN_CSCTLE_PSST/EsPiSz/esoifz(eTo)f)( T{) ) | { ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here: 33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | 33 | p r i m s ( tpirdi,m sn(tthirde,a dnst,h r&eraidnsg,- >&prrienvg,- >&prrienvg,- >&nreixntg,- >anregxst-,> saerngdsb-u>fsfe,n dabrugfsf-,> raercgvsb-u>frfe,c vabrugfsf-,> raerdgOsp-A>rrge,d O0p,A ragr,g s0-,> caorngnsI-n>dceoxn,n Ianrdgesx-,> caorngnsI-n>dceoxn)n;I n d| e ^x ); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h78: | 78 : 5 : rnote: uin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heren Ring <78T | , R e drOupn,R iPnrgo (RaerdgOsp),; P r| o ^t o>(a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:)53;: note: | in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: Rin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereu nWo r202k | E l e m e n t < FRnu,n WTo,r kREeldeOmpe,n tAO(p),. rAulng(ow,e )P;r o t| o ^> ().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppw:e5):;1 : | note: ^in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp :54 | :I1M:P Lnote: _in instantiation of member function 'RunWork, 1, 2>::run' requested hereC OLL _4F | UINMCP(LR_eCdOuLcLe_SFcUaNtCt(eRre,d uRcIeNSGc,a tStIeMrP,L ER,I NMGa,x ,S IuMiPnLtE8,_ tM)a x ,| ^i nt8_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 391 :| 95^: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC'391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->reIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)15 :{ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:d33):,7 :n tnote: hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herer eads(n t33h | r e a d s ) ,p rtiimdsI(ntBildo,c kn(tthhrreeaaddsI,d x&.rxi)n,g -g>rporuepv(,g r&oruipn)g,- > n| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x t ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rgs-> s563e | n d b u fsft,e paSrigzse-(>nrcecclvSbhumfefm,. caormgms.-b>urfefdSOipzAersg[,N C0C,L _arPgRsO-T>Oc_oSnInMIPnLdEe]x/,N CaCrLg_sS-T>EcPoSn/nsIinzdeeoxf)(;T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :| 78 group(group: 5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :33: 778: | note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here run R33i | n g < T , RperdiOmps,( tPirdo,t on>t(harregasd)s;, &| r ^i ng->prev, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:r202i:n53g:- >note: nin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heree xt, a202r | g s - > s e n d bRuufnfW,o args->recvbuff,r kaErlgesm-e>nrtegcoo,n nPIrnodteox>,( )a.rrgusn-(>wceo)n;n I n| d ^e x); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:: 78note: :in instantiation of member function 'RunWork, 1, 2>::run' requested here5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here9 | IM P78L | _ C O L Lr_uFnURNiCn(gRN(Ga,r gSsI)M;P L E| , ^ Max, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:n202t:6534:_ tnote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : Rnote: uexpanded from macro 'IMPL_COLL_FUNC'n WorkEl e391m | e n tRe(,) .Fruunnc(#w#ed)e;v r e| d ^o p:,9 :N1C:C Lnote: _in instantiation of member function 'RunWork, 1, 2>::run' requested hereA LGO _9# | #IaMlPgLo_,C ONLCLC_LF_UPNRCO(TROe_d#u#cperSoctaot>t(e)r.,r uRnI(N&Gn,c cSlISMhPmLeEm,. wMoarxk,) ;u i\n t 6| 4 ^_ t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 391note: :field 'nthreads' will be initialized after field 'tidInBlock'95 : note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t i dR(utniWdo)r,k ),, NgCrCoLu_pA(LgGrOo_u#p#)a,l g o| , ^~~~~~~~~~~~~~~~~ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: #field 'group' will be initialized after field 'stepSize'# proto> (562) | . r u n (t&indc(ctliSdh)m,e mn.twhorreka)d;s (\n t h| r ^e ads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bnote: lfield 'nthreads' will be initialized after field 'tidInBlock'o ck(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~E PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' oto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hc:o386m:m9.:b uwarning: fvariable 'wireOffset' set but not used [-Wunused-but-set-variable]f Size s386[ | N C C L _iPnRtO TwOi_rLeLO1f2f8s]e/tN C=C LW_iSrTeEWPoSr/dsPiezreSolfi(cuei*nwta6r4p_ t+) )2 *{w i d| ; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hunused variable 'data1' [-Wunused-variable] :134:14: note: initialize the variable 'dst' to silence this warning 153134 | | uvionitd3 2*_dts td,a t*as1r,c ;f l a| g ^1 , | d = nullptra ta2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:d154s:)10,: twarning: ivariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]d InB l154o | c k ( t hcraesaed I3d:x . x| ) ^, group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp(:g5r:o9u:p )note: ,in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 5 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | 563 | M S CsCtLe_pISMiPzLe_(KnEcRcNlESLh_mEeNmT.RcYo_mFmU.NbCu_fDfESViRzEeDsO[PN_CTCYLP_EP(RMOaTxO,_ SuIiMnPtL3E2]_/tN,C CfLa_lSsTeE)P;S / s| i ^z eof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hT:)402): 3{: note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 402 | msc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:l217R:u57n:I nnote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested heree rpret e217r | < t yPprei,m iFtuinvce#s#A,s yPmrmoettorLiLc1<218,,1 >f,u l1l,O pPsr>o(tcoo,m m0,> aplrgiom,s w o| r ^k ); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp| : ^5 :9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: 165in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here: 33: note: uninitialized use occurs here5 | 165 | M ScCoCpLy_TIoMSPhLm_eKmE8R(NtEiLd_%EWNATRRPY__SFIUZNEC,_ DdEsVtR,E DsOrPc_,T YbPyEt(eMsa)x;, u| i ^~~n t32_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:,162 :f5a:l swarning: evariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]) ; 162| | ^ d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:f405a:u3l:t :note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165 :40533 | : note: muninitialized use occurs heres ccl R165u | n I n t ecroppryeTtoeSrhr,c ,P rboyttoeSsi)m;p l e| < ^~~M SCCL_CHUNKSTEPS/MSCCL_SLICESTEPS, MSCCL_SLICESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I134n:B14l:o cnote: kinitialize the variable 'dst' to silence this warning( thr e134a | d I d x .vxo)i,d g*rdosutp,( g*rsorucp;) , | ^| ^~~~~~~~~~~~~~~~~ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp165::133: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: 154uninitialized use occurs here: 10: warning: 165variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] | co p154y | T o S h mceams8e( t3i:d % W| A ^R P_SIZE, dst, src/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp,: 5b:y9t:e snote: )in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here; | ^~~ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpretero,i dP r*odtsotL,L ,* sfrucl;l O p| s ^> ( c| o = nullptrm m, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, srIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp,: 1b: yIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:s13): ;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :| 167 ^~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:m134.:b14u:f fnote: Sinitialize the variable 'dst' to silence this warningi zes[ N134C | C L _ P RvOoTiOd_ S*IdMsPtL,E ]*/sNrCcC;L _ S| T ^E P S| / = nullptrs izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd(tid:)154,: 10n:t hwarning: revariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]a ds(nth r154e | a d s ) ,c atsied I3n:B l o| c ^k (threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cppx:)5,: 9g:r onote: uin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herep (gro u5p | ) , | ^~~~~~~~~~~ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.him:p154l:e10<:M Swarning: Cvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]C L_CHU N154K | S T E P Sc/aMsSeC C3L:_ S L| I ^C ESTEPS, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cppM:S5C:C9L:_ Snote: Lin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereI CESTE P5S | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h165: | 386 : 9 : cwarning: ovariable 'wireOffset' set but not used [-Wunused-but-set-variable]p yToSh m386e | m 8 ( t iidn%tW AwRiPr_eSOIfZfEs,e td s=t ,W isrrecW,o rbdyPteersS)l;i c e| * ^~~w arp /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h+: 1622:*5w:i dwarning: ;variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | ^162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter:,10 :P rwarning: ovariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]t oLL ,154 | f u l l Ocpass>e( c3o:m m ,| ^a lgo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppw:o5r:k9):; note: \in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here | ^ 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165 : 33 :M Snote: Cuninitialized use occurs hereC L_IMPL_ K165E | R N E L _cEoNpTyRTYo_SFhUmNeCm_8D(EtViRdE%DWOAPR_PT_YSPIEZ(EM,a xd,s ti,n ts3r2c_,t ,b yftaelss)e;) ; | ^~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h3::162 :note: 5expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE': warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 402 | 162 | m s c c ldReufnaIunltte:r p r| e ^~~~~~~t ero,S hPmreomt8o(LtLi1d2%8W,A RfPu_lSlIOZpEs,> (dcsotm,m ,s racl,g ob,y tweosr)k;) ; | \ ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1 386: | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 13 : iIn file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ht: 168w: i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hr:e153O:f14f:s ewarning: tunused variable 'data1' [-Wunused-variable] = WireWor d153P | e r S l iucien*tw3a2r_pt +d a2t*aw1i,d ;f l a| g ^1 , data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr o, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::33134: :note: 14uninitialized use occurs here: note: initialize the variable 'dst' to silence this warning 165 | c o134p | y T o S hvmoeimd8 (*tdisdt%,W A*RsPr_cS;I Z E| , ^ d s| t = nullptr, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:i134d:I14n:B lnote: oinitialize the variable 'dst' to silence this warningc k(thr e134a | d I d x .vxo)i,d g*rdosutp,( g*rsorucp;) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | = nullptr| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr EVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ oShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^ :154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :154402 | : 3 : note: cexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'a se 3: | 402 ^ | msccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cppR:u5n:I9n:t enote: rin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herep reter <5t | y p e , F u n cM#S#CdCeLv_rIeMdPoLp_L,_ EPNrToRtYo_LFLU1N2C8_,D EfVuRlElDOOpPs_>T(YcPoEm(mM,a xa,l gion,t 8w_otr,k )f;a l\s e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSize | s[ NC C L _sPtReOpTSOi_zLeL(1n2c8c]l/SNhCmCeLm_.ScToEmPmS./bsuifzfeSoifz(eusi[nNtC6C4L__tP)R)O T{O _ S| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| E group(group] /NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:S217/:s57i:z enote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested heref (T)) 217{ | | P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r i m| i group(groupt ives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herec <1,1>, 1217, | P rPortiom,i t0i>v epsr, ProtoLL128, false>' requested herei c<1,1 >5, | 1 , P r o t oM,S C0C>L _pIrMiPmLs_ K E| R ^N EL_ENTRY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp_:F5U:N9C:_ Dnote: Ein instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereV REDO P5_ | T Y P E ( M a x ,M SiCnCtL8__ItM,P Lf_aKlEsReN)E;L _ E| N ^T RY_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hU:N402C:_3D:E Vnote: Rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'E DOP_T Y402P | E ( Mmasxc,c liRnutn8I_ntt,e rfparlestee)r;< t y| p ^e , Fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:#405#:d3e:v rnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd op ,m sPcrcoltRouLnLI1n2t8e,r pfrueltleOrpy(pceo,m mF,u nacl#g#od,e vwroerdko)p;< t\y p e| > ^, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp::5141:: 9In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :warning: 13variable 'offset' set but not used [-Wunused-but-set-variable]: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169 : 514/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h | : 271 : 19 :i nwarning: tunused variable 'ptr' [-Wunused-variable] offset 271= | t i d ; | ^u int64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp : 1s: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:p154S:i10z:e (warning: nvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]c clShm e154m | . c o m mc.absuef f3S:i z e| s ^[ NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppS:I5M:P9L:E ]note: /in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested hereN CCL _5S | T E P S / s i z eMoSfC(CTL)_)I M{P L _| K ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E R N| E group(groupL _ENTRY_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hU:N217C:_57D:E Vnote: Rin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereE DOP_TY P217E | ( M aPxr,i mdiotuibvlees,< Tf,a lRseed)O;p , | F ^a nAs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hy:m399m:e3t:r inote: cexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'< 1,1> ,399 | 1 , mPsrcoctloR,u n0I>n tperripmrse t e| r ^< type, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cppF:u5n:c9: note: ##devredop, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr VREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h ^~~~~: 514:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :warning: 153variable 'offset' set but not used [-Wunused-but-set-variable]: 28: warning: unused variable 'data2' [-Wunused-variable]514 | 153 | i n t oufifnste3t2 _=t tdiadt;a 1 ,| ^f lag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uintIn file included from 6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp4:_1t: *In file included from ptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :=13 : rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hc:v169P: t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hr:(2710:)19+:l lwarning: 1unused variable 'ptr' [-Wunused-variable]2 8Offset ;271 | | ^~~ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h*:w386a:r9p: +warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable]2 *wid; | 386 ^ | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ ^~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::165165::3333:: note: note: uninitialized use occurs hereuninitialized use occurs here 165 | 165 | c ocpoypTyoTSohSmhemme8m(8t(itdi%dW%AWRAPR_PS_ISZIEZ,E ,d sdts,t ,s rscr,c ,b ybtyetse)s;) ; | ^~~| ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr EL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, In file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cppI:N1G: ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hS:I10M: PIn file included from L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hE:,167 : P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562M:u15l:S uwarning: minitializer order does not match the declaration order [-Wreorder-ctor], int32_t) 562 | | ^ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391(:t95i:d )note: ,expanded from macro 'IMPL_COLL_FUNC' nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _ALGO _563# | # a l g os,t eNpCSCiLz_eP(RnOcTcOl_S#h#mpermo.tcoo>m(m)..bruufnf(S&inzcecsl[SNhCmCeLm_.PwRoOrTkO)_;S I\M P L| E ^] /NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T15E:P Snote: /field 'nthreads' will be initialized after field 'tidInBlock's izeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.ht:h33r:e7a:d snote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here, tidIn B33l | o c k ( t h rperaidmIsd(xt.ixd),, ngtrhoruepa(dgsr,o u&pr)i,n g -| > ^~~~~~~~~~~~~~~~~p re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hv:,562 :&60r:i nnote: gfield 'group' will be initialized after field 'stepSize'- >nex t562, | a r g st-i>ds(etniddb)u,f fn,t harregasd-s>(rnetchvrbeuafdfs,) ,a rtgisd-I>nrBeldoOcpkA(rtgh,r e0a,d Iadrxg.sx-)>,c ognrnoIunpd(egxr,o uapr)g,s - >| c ^~~~~~~~~~~o nnIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herep Size(n c33c | l S h m e m .pcroimmms.(btuifdf,S inztehsr[eNaCdCsL,_ P&RrOiTnOg_-S>IpMrPeLvE,] /&NrCiCnLg_-S>TnEePxSt/,s iazregosf-(>Ts)e)n d{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hv:b33u:f7f:, note: ain instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herer gs->re d33O | p A r g , 0p,r iamrsg(st-i>dc,o nnntIhnrdeeaxd,s ,a r&grsi-n>gc-o>npnrIenvd,e x&)r;i n g| - ^> next, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hr:g78s:-5>:s enote: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hered buff ,78 | a r g s -r>urneRcivnbgu rPerdoOtpoA>r(ga,r g0s,) ;a r g| s ^- >conn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:n202d:e53x:, note: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer gs-> c202o | n n I n d e x ) ;R u n| W ^o rkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hm:e78n:t5<:F nnote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here T, R78e | d O p , rAulngRoi,n gPe(d)O.pr,u nP(rwoet)o;> ( a| r ^g s); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp ^: 5:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :in instantiation of member function 'RunWork, 1, 2>::run' requested here202 :53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here5 | IMP L202_ | C O L L _ F U N CR(uRneWdourckeESlceamtetnetr<,F nR,I NTG,, RSeIdMOPpL,E ,A lPgroe,M Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ype>, NCuClLS_uAmL,G Ou_i#n#ta8l_gto), N| C^C L_PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:p391r:o95t:o >note: (expanded from macro 'IMPL_COLL_FUNC') .run(&nc c391l | S h mReumn.Wwoorrkk<)n;c c\l F u| n ^c ##func, typ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :F15u:n cnote: #field 'nthreads' will be initialized after field 'tidInBlock'# devredo p562< | t y p e >t,i dN(CtCiLd_)A,L GnOt_h#r#eaaldgso(,n tNhCrCeLa_dPsR)O,T Ot_i#d#IpnrBoltooc>k(()t.hrruena(d&Indcxc.lxS)h,m egmr.owuopr(kg)r;o u\p ) ,| ^ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::15 :note: field 'group' will be initialized after field 'stepSize'note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:A15r:g ,warning: initializer order does not match the declaration order [-Wreorder-ctor]0 , args-> c562o | n n I n dteixd,( tairdg)s,- >nctohnrneIanddse(xn)t;h r e| a ^d s), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hd:I78n:B5l:o cnote: kin instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here( thre a78d | I d x . xr)u,n Rgirnogu tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( args) ;563 | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e202(:n53c:c lnote: Sin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereh mem. c202o | m m . b u f f S iRzuensW[oNrCkCELl_ePmReOnTtO<_FSnI,M PTL,E ]R/eNdCOCpL,_ SATlEgPoS,/ sPirzoetoof>((T)).)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp33::107::1 :note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herenote: in instantiation of member function 'RunWork, 1, 2>::run' requested here 1033 | | I M P L _ C OpLrLi_mFsU(NtCid, (nRtehdruecaedSsc,a t&treirn,g -R>IpNrGe,v ,S I&MrPiLnEg,- >PnreexMtu,l Saurmg,s->sendb uhfafl,f )a r g| s^- >recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u391f:f95,: anote: rexpanded from macro 'IMPL_COLL_FUNC'g s->redO p391A | r g ,R u0n,W oarrkgccloFnunnIcn#d#efxu,n ca,r gtsy-p>ec,o nFnuInncd#e#xd)e;v r e| d ^o p:,78 :N5C:C Lnote: _in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested hereA LGO _78# | # a l g or,u nNRCiCnLg_t(o)>.(raurng(s&)n;c c l| S ^h mem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;202 :\53 : | note: ^in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (t)i.drIunnB(lwoec)k;( t h| r ^e adIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp):,10 :g1r:o unote: pin instantiation of member function 'RunWork, 1, 2>::run' requested here( grou p10) | ,I M P| L ^~~~~~~~~~~~~~~~~_ COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:F562U:N60C:( Rnote: efield 'group' will be initialized after field 'stepSize'd uceSc a562t | t e r , tRiIdN(Gt,i dS)I,M PnLtEh,r ePardesM(unltShurme,a dhsa)l,f )t i d| I^n Bloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:(391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC'I dx.x) ,391 | g r oRuupn(Wgorroku, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | 562 ^~~~~~~~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:e562x:t15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->sendbu f562f | , a r gtsi-d>(rteicdv)b,u fnft,h raeragdss-(>nrtehdrOepaAdrsg),, 0t,i daIrngBsl-o>ccko(ntnhIrnedaedxI,d xa.rxg)s,- >gcroonunpI(ngdreoxu)p;) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78: 5563: | note: stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | intIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cppw:i1r: eIn file included from O/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:f13s: eIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :=169 : W/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hi:r271e:W19o:r dwarning: Punused variable 'ptr' [-Wunused-variable]e rSlice *271w | a r p + 2 * wuiidn;t 6 4| _ ^t * ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hpSi:z154e:(10n:c cwarning: lvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]S hmem.com m154. | b u f f Sciazsees [3N:C C L| _ ^P ROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp]:/5N:C9C:L _note: Sin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereT EPS/s i5z | e o f ( T ) ) {M S C| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L _ I| M group(groupP L_KERNEL_ENTRY_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hN:C217_:D57E:V Rnote: Ein instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereD OP_TY P217E | ( P rPordi,m irtcicvle_sb402,: 31:, note: Pexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'r oto, 0 >402 | p r immssc c l| R ^u nInterp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cppr:e5t:e9r:< tnote: yin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herep e, F u5n | c # # d e v r e dMoSpCP,L _PKrEoRtNoELLL_1E2N8T,R Yf_uFlUlNOCp_sD>E(VcRoEmDmO,P _aTlYgPoE,( Pwroordk,) ;r c\c l _| b ^f loat1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h6:,165 :f33a:l snote: euninitialized use occurs here) ; | ^ 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:o405p:y3T:o Snote: hexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'm em8(tid %405W | A R Pm_sScIcZlER,u ndIsntt,e rsprrce,t ebrywarning: ,variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] Pro t162o | S i m p ldeeP,_ SfIuZlEl,O pdss>t(,c osmrmc,, ablygtoe,s )w;o r k| ) ^~~; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:d134s:)14,: tnote: iinitialize the variable 'dst' to silence this warningd InBlock( t134h | r e a d Ivdoxi.dx )*,d sgtr,o u*ps(rgcr;o u p| ) ^, | | = nullptr ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::154402::103:: warning: note: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402154 | | m s cccalsReu n3I:n t e| r ^p reter, ProtoSimple<2, 2>, false>' requested here# #devr e5d | o p < t y p e > ,M SPCrCoLt_oILMLP1L2_8K,E RfNuElLl_OEpNsT>R(Yc_oFmUmN,C _aDlEgVoR,E DwOoPr_kT)Y;P E\( M a| x ^, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),In file included from tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppn:B1l: o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:k154(:t10h:r ewarning: avariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]d Idx.x), g154r | o u p ( gcraosuep )3,: | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppS:i5z:e9(:n cnote: cin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested herel Shme m5. | c o m m . b u f fMSSiCzCeLs_[INMCPCLL__KPERRONTEOL__SEINMTPRLYE_]F/UNNCCC_LD_ESVTREEPDSO/Ps_iTzYePoEf((MTa)x), {u i n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~8 _ t| , group(group false); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 217 ^: 57: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here: 399:3: 217note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' Prim i399t | i v emss#,# d1e,v rPerdootpo<,t y0p>e >p,r iPmrso t o| L ^L , ful/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppl:O5p:s9>:( cnote: oin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herem m, a l5g | o , w o r k ) ;M S\C C L| _ ^I MPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hK:E165R:N33E:L _note: Euninitialized use occurs hereN TRY_ F165U | N C _ D EcVoRpEyDTOoPS_hTmYePmE8((Mtaixd,% WuAiRnPt_8S_ItZ,E ,f adlsste,) ;s r c| , ^ byte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)405;: 3 :| ^~~note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 162 :m5s:c cwarning: lvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]R unI n162t | e r p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h r: e154dt:ee10fr:a< utwarning: lyvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]tp :e , | F154 ^~~~~~~u | n c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h# :# 165dc:ea33vs:re e note: d3uninitialized use occurs hereo: p < t| y ^165p | e > , Pcr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cppoo:pt5yo:TS9oi:Sm hpnote: mlin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereee m<8M( St5Ci | Cd L% _W CA HR UP N_ KS SIMTZSEECP,CS L/d_MsIStMC,PC LLs__rKScEL,RI NCbEEyLSt_TeEEsNP)TS;R, Y _M| FS ^~~UC NCCL__DSELVIRCEEDSOTPE_PTSY>P,E (fMualxl,O pusi>n(tc8o_mtm,, faallgsoe,) ;w o r| k ^) ; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^405 :3: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE': 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 405 | m s562c | c l R u ntIindt(etripdr)e,t enrtc,k (PtrhorteoaSdiImdpxl.e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hx<:)M134,S: C14gC:rL o_note: uCinitialize the variable 'dst' to silence this warningpH (UgNr Ko134Su | Tp E) P, S / vM| oS ^~~~~~~~~~~~~~~~~iC dC L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h*_:dS562sL:tI60,C:E S*note: Tsfield 'group' will be initialized after field 'stepSize'Er PcS;, 562M| | S ^ C C L| _ = nullptrtS iLdI(CtEiSdT)E,P Sn>t,h rfeualdlsO(pnst>h(rceoamdms,) ,a ltgiod,I nwBolrokc)k;( t\h r e| a ^d Idx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hx:)165,: 33g:r onote: uuninitialized use occurs herep (gro u165p | ) , | ^~~~~~~~~~~c opyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 15 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloaIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr oto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRYIn file included from _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cppF:U1N: CIn file included from _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hD:E13V: RIn file included from E/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hD:O169P: _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hT:Y509P:E29(:M awarning: xfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor], rccl_bf l507o | a t 1 6 ,t ifda(ltsied));, n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:n402t:h3r:e anote: dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE's ), wi d402( | t i dm%sWcAcRlPR_uSnIIZnEt)e,r pwraertpe(rt, P508r | o t o L Lw1a2r8p,I nfBullolcOkp(st>h(rceoamdmI,d xa.lxg/oW,A RwPo_rSkI)Z;E )\, | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmeIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp8:(1ti: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h%:W154A:R10P:_ Swarning: Ivariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]Z E, dst, sr c154, | b y t ecsa)s;e 3| : ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp :d5e:f9a:u lnote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h5: | 165 : 33 : note: uninitialized use occurs here MS C165C | L _ I M PcLo_pKyETRoNSEhLm_eEmN8T(RtYi_dF%UWNACR_PD_ESVIRZEED,O Pd_sTtY,P Es(rMca,x ,b yrtcecsl)_;b f l| o ^~~a t16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 15 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h5: | 154 : 10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] MSCC L154_ | I M P L _cKaEsReN E3L:_ E N| T ^R Y_FUNC_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cppE:V5R:E9D:O Pnote: _in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereT YPE(Mi n5, | u i n t 8 _ t ,M SfCaClLs_eI)M;P L _| K ^E RNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:_402E:N3T:R Ynote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'F UNC_ D402E | V R EmDsOcPc_lTRYuPnEI(nMtienr,p rueitnetr8<_tty,p ef,a lFsuen)c;# # d| e ^v redop/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h<:t402y:p3e:> ,note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'P rotoLL 14022 | 8 , mfsuclcllORpusn>I(nctoemrmp,r eatlegro<,t ywpoer,k )F;u n\c # #| d ^e vredop, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, ds/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154 :510 | : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] M S154C | C L _ I McPaLs_eK E3R:N E L| _ ^E NTRY_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cppN:C5_:D9E:V Rnote: Ein instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereD OP_T Y5P | E ( M i n , i nMtS3C2C_Lt_,I MfPaLl_sKeE)R;N E L| _ ^E NTRY_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hU:N399C:_3D:E Vnote: Rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'E DOP_TY P399E | ( M imns,c cilnRtu3n2I_ntt,e rfparlestee)r;< t y| p ^e , Fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:#402#:d3e:v rnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd op ,m sPcrcoltRouLnLI,n tfeurlplrOeptse>r(, ProtoLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h1:21658:,33 :f unote: luninitialized use occurs herel Ops>(co m165m | , a l gcoo,p ywToorSkh)m;e m\8 ( t| i ^d %WA/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:P165_:S33I:Z Enote: ,uninitialized use occurs here dst, 165s | r c , bcyotpeysT)o;S h m| e ^~~m 8(tid%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S162I:Z5E:, warning: dvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]s t, s162r | c , b ydteefsa)u;l t :| ^~~ | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::162165::5:33 :warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]note: uninitialized use occurs here 162 | 165 | d e f acuolpty:T o S| h ^~~~~~~m em/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h8:(165t:i33d:% Wnote: Auninitialized use occurs hereR P_S I165Z | E , d scto,p ysTrocS,h mbeymt8e(st)i;d % W| A ^~~R P_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h*:d134s:t14,: *note: sinitialize the variable 'dst' to silence this warningr c; | ^134 | | = nullptr void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives514, | 1 , Pirnott oo,f f0s>e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | w a rcpoIpnyBTlooSchkm(etmh8r(etaiddI%dWxA.RxP/_WSAIRZPE_,S IdZsEt),, s r| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, b| y warp(tid/WARP_SIZEt es); | ^~~ 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~134 : 14| : group(group note: initialize the variable 'dst' to silence this warning 134 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 217 : 57v:o inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here *dst, 217* | s r cP;r i m| i ^t i v| e = nullptrs , 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 154165: | 10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154c | o p y T ocSahsmee m38:( t i| d ^% WARP_SIZ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cppE:,5 :d9s:t ,note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heres rc, 5b | y t e s ) ; | M ^~~S CCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::33134:: 14note: :uninitialized use occurs here note: initialize the variable 'dst' to silence this warning 165 | 134 | c o pvyoTiodS h*mdesmt8,( t*isdr%cW;A R P| _ ^S I Z| E = nullptr, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hZ:E134):,14 :w anote: rinitialize the variable 'dst' to silence this warningp (ti d134/ | W A R P _vSoIiZdE )*,d st, *| s ~~~~~~~~~~~~~~~~~~r c ;| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) | ^ 508| | = nullptr warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp::2171:: 57In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :note: 13in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h: 169217: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 509P:r29i:m iwarning: tfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]i vess,( n1t,h rPeraodtso),, 0w>i dp(rtiimds% W A| R ^P _SIZE)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp,: 5w:a9r:p (note: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herei d/WA R5P | _ S I Z E ) , M| S ~~~~~~~~~~~~~~~~~~C C L| _ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)I MPL _508K | E R N E Lw_aErNpTIRnYB_lFoUcNkC(_tDhErVeRaEdDIOdPx_.TxY/PWEA(RMPi_nS,I ZuEi)n,t 3 2| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t , | f warp(tid/WARP_SIZEa lse) ;509 | | ^ flag/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hT:h402r:e3a:d (note: (expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't id%4) =402= | 3 ) ,m sgcrcoluRpu(ngIrnotuepr)p,re t e| r ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~< t y| p warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3e , Fun c510# | # d e v rsetdeoppSc,c lPSrhomteomL.Lc1o2m8m,. bfuuflflSOipzse>s([cNoCmCmL,_ PaRlOgToO,_ LwLo1r2k8)];/ N\C C L| _ ^S TEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp :f5u:l9l:O pnote: sin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here> (comm ,5 | a l g o , w o rMkS)C;C L\_ I M| P ^L _KE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:N165E:L33_:E Nnote: Tuninitialized use occurs hereR Y_FUN C165_ | D E V R EcDoOpPy_TToYSPhEm(eMmi8n(,t iudi%nWtA3R2P__tS,I ZfEa,l sdes)t;, s| r ^c , by/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:e399s:)3;: note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h399: | 162 : 5m:s cwarning: cvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]l Ru n162I | n t e r pdreeftaeurl ,165 | P r o t ocLoLp,y TfouSlhlmOepms8>((tciodm%mW,A RaPl_gSoI,Z Ew,o rdks)t;, \s r c| , ^ byte/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:)165;: 33 :| ^~~note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hZ:E134,: 14d:s tnote: ,initialize the variable 'dst' to silence this warning src, 134b | y t e s )v;o i d| ^~~* dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:l134s:e14):; note: initialize the variable 'dst' to silence this warning| ^ 134/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 405 : 3 :v onote: iexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd *dst, *405s | r c ;m s c| c ^l R u| n = nullptrI nterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | cIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cppp:y1T: oIn file included from S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hh:m13e: mIn file included from 8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h(:t167i: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h%:W562A:R15P:_ Swarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]Z E, dst, s r562c | , b y tteisd)(;t i d| ) ^~~, nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:a162d:s5(:n twarning: hvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]r ea d162s | ) , t iddeIfnaBullotc:k ( t| h ^~~~~~~r ea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I165d:x33.:x )note: ,uninitialized use occurs here grou p165( | g r o ucpo)p,y T o| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h m e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)8 (tid %563W | A R P _ SsItZeEp,S idzset(,n cscrlcS,h mbeymt.ecso)m;m . b| u ^~~f fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp134::514::9 :note: initialize the variable 'dst' to silence this warningnote: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 134 | 5 | v o i d *MdSsCtC,L _*IsMrPcL;_ K E| R ^N E L| _ = nullptrE NTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple562,: 15f:u lwarning: linitializer order does not match the declaration order [-Wreorder-ctor]O ps>(comm ,562 | a l g o ,t iwdo(rtki)d;) ,\ n t| h ^r eads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs: 154 :| 10 ^: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp: 5154: | 9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herec ase 35: | | ^ MS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cppC:C5L:_9I:M Pnote: Lin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here_ KERNE L5_ | E N T R Y _ F U NMCS_CDCELV_RIEMDPOLP__KTEYRPNEE(LM_iEnN,T RuYi_nFtU3N2C__tD,E VfRaElDsOeP)_;T Y P| E ^( Min, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:i405n:t33:2 _note: texpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE', false )405; | | m ^s cclR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:n402I:n3t:e rnote: pexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'r eter< t402y | p e ,m sFcucnlcR#u#ndIenvtreerdporpey,p eP,r oFtuonSci#m#pdleevK,S TPErPoSt/oMLSLC1C2L8_,S LfIuClElSOTpEsP>S(,c oMmSmC,C La_lSgLoI,C work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ESTEPS>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | vo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:d154 :*10d:s twarning: ,variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] *src; 154| | ^ | = nullptrc ase 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: In file included from variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp :1 : 162In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 13 : In file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.he:f167a: u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:t562:: 15 :| ^~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 562 | 165 | t i d ( tciodp)y,T onSthhmreema8d(st(indt%hWrAeRaPd_sS)I,Z Et,i ddIsntB,l oscrkc(,t hbryetaedsI)d;x . x| ) ^~~, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :P134r:i14m:i tnote: iinitialize the variable 'dst' to silence this warningv es , | 1 = nullptr, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppe:r1<: tIn file included from y/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:e13,: In file included from F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hu:n169c: #/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h#:d509e:v29r:e dwarning: ofield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]p , P507r | o t o S itmipdl(etd,/ WfAuRlPl_OSpIsZ>E()c,o m m| , ~~~~~~~~~~~~~~~~~~ a l| g stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)o , wo r508k | ) ; \ w a| r ^p InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x /562W | A R P _ StIiZdE()t,i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h warp(tid/WARP_SIZEr eads (509n | t h r e afdlsa)g,T htriedaIdn(B(ltoicdk%(4t)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h 134 | : 134v:o14i:d note: *initialize the variable 'dst' to silence this warningd st, *src; 134| | ^ | = nullptrv oid *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppd:)1,: In file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:h13r: eIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:s167(: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d swarning: )initializer order does not match the declaration order [-Wreorder-ctor], tidInBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i dInBl o563c | k ( t h rsetaedpISdixz.ex()n,c cglrSohumpe(mg.rcooumpm).,b u f| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s [NCCL _563P | R O T O _sStIeMpPSLiEz]e/(NnCcCcLl_SShTmEePmS./csoimzme.obfu(fTf)S)i z{e s [| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupP ROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:L217E:]57/:N Cnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested hereL _STE P217S | / s iPzreiomfi(tTi)v)e s{< T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| O group(groupp , FanAsy/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:m217e:t57r:i cnote: , FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here1 ,1>, 2171 | , PPrroitmoi,t i0v>e sp, ProtoSimple<2, 2>, false>' requested herei c<1, 15> | , 1 , P r o tMoS,C C0L>_ IpMrPiLm_sK E R| N ^E L_ENTR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppY:_5F:U9N:C _note: Din instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereE VRED O5P | _ T Y P E ( M i nM,S CiCnLt_6I4M_PtL,_ KfEaRlNsEeL)_;E N T| R ^Y _FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hN:C405_:D3E:V Rnote: Eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'D OP_TYP E405( | M i nm,s cicnltR6u4n_Itn,t efraplrseet)e;r < t| y ^p e, Fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:c405#:#3d:e vnote: rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e dop | , PmrsoctcolSRiumnpIlnet ,M SPCrCoLt_oSSLiImCpElSeTC,L _fCuHlUlNOKpSsT>E(PcSo/mMmS,C CaLl_gSoL,I CwEoSrTkE)P;S ,\ M S| C ^C L_SL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:C562E:S15T:E Pnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'> , fu l562l | O p s > (tciodm(mt,i da)l,g on,t hwroerakd)s;( n\t h r| e ^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock'c k(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~h re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)60,: tnote: ifield 'group' will be initialized after field 'stepSize'd InBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~, tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ : warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | war/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:I154n:B10l:o cwarning: kvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]( threa d154I | d x . x /cWaAsReP _3S:I Z E| ) ^, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp :5: 9509: | note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here flagT h5r | e a d ( ( t i d %M4S)C=C=L3_)I,M PgLr_oKuEpR(NgErLo_uEpN)T,R Y _| F ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~U N C| _ warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3D EVRE D510O | P _ T Y PsEt(eMpiSni,z e(ncclShhamlefm,. cfoamlms.eb)u;f f S| i ^z es[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:_402P:R3O:T Onote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'L L128]/N C402C | L _ SmTsEcPcSl/RsuinzIenotfe(rupirnett6e4r_note: ,in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here Pro t217o | L L 1P2r8i,m iftuilvleOsp,( cRoemdmO,p ,a lFgaon,A swyomrmke)t;r i\c < 1| , ^1 >, 1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :P165r:o33t:o ,note: uninitialized use occurs here0 > prim s165 | | ^ co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cppp:y5T:o9S:h mnote: ein instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested herem 8(t i5d | % W A R P _ S I ZMES,C CdLs_tI,M PsLr_cK,E RbNyEtLe_sE)N;T R Y| _ ^~~F UNC_DEVR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:D162O:P5_:T Ywarning: Pvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]E (Min ,162 | h a l f ,d effaaluslet):; | | ^~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::402165::333:: note: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'uninitialized use occurs here 402165 | | m s cccolpRyuTnoISnhtmeermp8r(ettiedr%),; P r| o ^~~t oLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rotoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cppe:m18: (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht:i154d:%10W:A Rwarning: Pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]_ SIZ E154, | d s t ,c assrec ,3 :b y t| e ^s ); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here162 :5: warning: 5variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] | 162 | M SdCeCfLa_uIlMtP:L _ K| E ^~~~~~~R NEL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:E165N:T33R:Y _note: Funinitialized use occurs hereU NC_DE V165R | E D O P _cToYpPyET(oMSihnm,e mf8l(otaitd,% WfAaRlPs_eS)I;Z E ,| ^d st,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :s399r:c3,: bnote: yexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE't es); | 399 ^~~ | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 134 :| 14 ^~~~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :initialize the variable 'dst' to silence this warning165 :33: note: 134uninitialized use occurs here | v o165i | d * d scto,p y*TsorSch;m e m| 8 ^( t i| d = nullptr% WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | dIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cppf:a1u: l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht::154 : 10| : ^~~~~~~ warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 165:33: note: 154uninitialized use occurs here | 165c | a s e 3c:o p y| T ^o Shmem8(tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cppW:A5R:P9_:S Inote: Zin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested hereE , dst ,5 | s r c , b y t eMsS)C;C L _| I ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr MPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZEIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr , dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/clang++ -fPIC -pipe -frecord-gcc-switches -Wall -g -O2 -parallel-jobs=16 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.1.40093 --hip-link --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 /usr/lib/llvm-rocm/lib64/clang/17/lib/linux/libclang_rt.builtins-x86_64.a -lpthread -lrt -ldl Elapsed time (seconds): 451.786 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Built target rccl gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.62946 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/rccl-buildroot + : + /bin/rm -rf -- /usr/src/tmp/rccl-buildroot + PATH=/usr/libexec/rpm-build:/usr/src/bin:/usr/bin:/bin:/usr/local/bin:/usr/games + cd rccl-2.18.6 + DESTDIR=/usr/src/tmp/rccl-buildroot + cmake --install x86_64-alt-linux --verbose -- Install configuration: "" -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1.0 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets-noconfig.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl/LICENSE.txt + rm -rf /usr/src/tmp/rccl-buildroot/usr/rccl + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/rccl-buildroot (auto) mode of './usr/lib64/librccl.so.1.0' changed from 0755 (rwxr-xr-x) to 0644 (rw-r--r--) Verifying and fixing files in /usr/src/tmp/rccl-buildroot (binconfig,pkgconfig,libtool,desktop,gnuconfig) Checking contents of files in /usr/src/tmp/rccl-buildroot/ (default) Compressing files in /usr/src/tmp/rccl-buildroot (auto) Adjusting library links in /usr/src/tmp/rccl-buildroot ./usr/lib64: (from :0) librccl.so.1 -> librccl.so.1.0 Verifying ELF objects in /usr/src/tmp/rccl-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) section [ 3] '.dynsym': symbol 338 (__hip_fatbin): symbol in dynamic symbol table with non-default visibility verify-elf: WARNING: ./usr/lib64/librccl.so.1.0: eu-elflint failed Splitting links to aliased files under /{,s}bin in /usr/src/tmp/rccl-buildroot Processing files: librccl1-2.18.6-alt0.1 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.54838 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + DOCDIR=/usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + export DOCDIR + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + /bin/mkdir -p /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + cp -prL README.md LICENSE.txt NOTICES.txt CHANGELOG.md /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R go-w /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R a+rX /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.rJSLpz find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) lib.prov: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1: 192 symbols, 18 bpp Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.5AlLG1 find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) warning: librccl1 provides another subpackage: rccl Provides: rccl = 2.18.6-alt0.1, librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIUZa5VdctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvWX09AlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8TzCb018bhHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 Requires: ld-linux-x86-64.so.2()(64bit) >= set:jiids, ld-linux-x86-64.so.2(GLIBC_2.3)(64bit), libamdhip64.so.6()(64bit) >= set:mgEl4iHah5shPP2z5A5zYttYI7XpZyRnhe1J6ZgwULwPlWeYZ4XbZd2bItRMqeW4hZmmUYmDZdpDnrYqkUKOuzfUwKzIyQItN97gggSsa6v6KYBa3m70aJ49gh1ckMQcuEPMZKgWZw, libamdhip64.so.6(hip_4.2)(64bit), libamdhip64.so.6(hip_4.3)(64bit), libamdhip64.so.6(hip_4.5)(64bit), libamdhip64.so.6(hip_5.0)(64bit), libamdhip64.so.6(hip_5.3)(64bit), libamdhip64.so.6(hip_6.0)(64bit), libc.so.6(GLIBC_2.14)(64bit), libc.so.6(GLIBC_2.17)(64bit), libc.so.6(GLIBC_2.2.5)(64bit), libc.so.6(GLIBC_2.3)(64bit), libc.so.6(GLIBC_2.3.2)(64bit), libc.so.6(GLIBC_2.3.4)(64bit), libc.so.6(GLIBC_2.33)(64bit), libc.so.6(GLIBC_2.34)(64bit), libc.so.6(GLIBC_2.38)(64bit), libc.so.6(GLIBC_2.6)(64bit), libgcc_s.so.1(GCC_3.0)(64bit), libm.so.6(GLIBC_2.2.5)(64bit), librocm_smi64.so.1()(64bit) >= set:miSwa9ZECgdMsH9hGiyEU5mNQ1, libstdc++.so.6(CXXABI_1.3)(64bit), libstdc++.so.6(CXXABI_1.3.5)(64bit), libstdc++.so.6(CXXABI_1.3.7)(64bit), libstdc++.so.6(GLIBCXX_3.4)(64bit), libstdc++.so.6(GLIBCXX_3.4.11)(64bit), libstdc++.so.6(GLIBCXX_3.4.18)(64bit), libstdc++.so.6(GLIBCXX_3.4.19)(64bit), libstdc++.so.6(GLIBCXX_3.4.21)(64bit), libstdc++.so.6(GLIBCXX_3.4.22)(64bit), libstdc++.so.6(GLIBCXX_3.4.29)(64bit), rtld(GNU_HASH) Requires(rpmlib): rpmlib(SetVersions) Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.Mq7CgW Creating librccl1-debuginfo package Processing files: librccl-devel-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.wjlI26 find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.Q4c1f4 find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:12: /usr/include/hip/hip_runtime.h:66:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 66 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:70: /usr/include/hip/hip_runtime_api.h:8852:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 8852 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:71: /usr/include/hip/library_types.h:75:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 75 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:73: /usr/include/hip/hip_vector_types.h:38:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 38 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:13: /usr/include/hip/hip_fp16.h:33:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 33 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ cpp.req: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed, trying c++ mode x86_64-alt-linux-cpp: fatal error: cannot execute 'cc1plus': execvp: No such file or directory compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h:10:10: fatal error: nccl.h: No such file or directory 10 | #include "nccl.h" | ^~~~~~~~ compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h: cpp failed Provides: rccl-devel = 2.18.6-alt0.1 Requires: /usr/lib64/librccl.so.1 Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.CyNLMH Processing files: librccl1-debuginfo-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.LPTUg8 find-provides: running scripts (debuginfo) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.oc2iHB find-requires: running scripts (debuginfo) Provides: debug64(librccl.so.1) Requires: librccl1 = 2.18.6-alt0.1, debug64(ld-linux-x86-64.so.2), debug64(libamdhip64.so.6), debug64(libc.so.6), debug64(libgcc_s.so.1), debug64(libm.so.6), debug64(librocm_smi64.so.1), debug64(libstdc++.so.6) Adding to librccl1-debuginfo a strict dependency on librccl1 Adding to librccl-devel a strict dependency on librccl1 Removing 1 extra deps from librccl-devel due to dependency on librccl1 Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl-devel-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm (w2.lzdio) 18986.72user 777.19system 23:42.56elapsed 1389%CPU (0avgtext+0avgdata 5528648maxresident)k 4568inputs+0outputs (103major+87106409minor)pagefaults 0swaps /.out/librccl1-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl-devel-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // 7.58user 5.86system 25:20.29elapsed 0%CPU (0avgtext+0avgdata 136564maxresident)k 2553552inputs+0outputs (0major+336888minor)pagefaults 0swaps